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ON 
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spoliation 
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US-PGPUB; 
USPAT; 
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DERWENT 
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12 or 13 
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ON 
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176 
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ON 
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USOCR; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/06/07 14:27 
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17 with 15 
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USPAT; 
USOCR; 
EPO; JPO; 
DERWENT 


OR 


ON 


2006/06/07 14:28 



6/7/06 2:29:15 PM 



Page 1 



SCORE Search Results Details for Application 10510408 and Search Result us- 10-5 10-4... Page 1 of 17 



SCORE Search Results Details for Application 
10510408 and Search Result us-10-510-408- 
2,rup. 



Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Suggestions 

This page gives you Search Results detail for the Application 10510408 and Search Result us-10- 

510-408-2. rup. 

start 

Go Back to previous page 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 

OM protein - protein search, using sw model 

Run on: June 2, 2006, 20:25:57 ; Search time 142 Seconds 

(without alignments) 
1126.956 Million cell updates/sec 

Title: US-10-510-408-2 
Perfect score: 875 

Sequence: 1 MYSRSKFKIGLLLIGSLLAA HVI EAVKS S GS QHH VEDMKT 173 

Scoring table: BLOSUM62 

Gapop 10.0 > Gapext 0.5 

Searched: 2849598 seqs, 925015592 residues 

Total number of hits satisfying chosen parameters: 2849598 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : UniProt_7 . 2 : * 

1: uniprot_sprot : * 
2 : uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 

1 875 100.0 173 2 Q65GP4_BACLD Q65gp4 bacillus li 

2 486.5 55.6 170 1 BOFC_BACSU 005391 bacillus su 

3 334 38.2 158 2 Q5KWQ8_GE0KA Q5kwq8 geobacillus 

4 281.5 32.2 176 2 Q730B0 BACC1 Q730b0 bacillus ce 
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280 . 5 


32 . 


1 


175 


2 


/"V A t Jtf^ O O T\ TV **** 

Q4MS2 8 BACCE 


Q4ms28 


bacillus ce 
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279.5 


31 . 


9 


176 
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Q634C1_BACCZ 


Q634cl 


bacillus ce 
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276.5 


31 . 


6 


176 
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Q3EJ53_BACTI 


Q3e j 53 


bacillus tn 


8 


276.5 


31 . 


6 


176 


2 


Q817W1_BACCR 


Q817wl 


bacillus ce 


9 


276.5 


31 . 


6 


176 


2 


^ OlT/^^T T^fc TV TV XT 

Qo 1 LG 6 BACAN 


Qollgo 


bacillus an 


10 


274.5 


31 . 


4 


176 


2 


/"it T"\ TV *"> T"* TV Tf t ,f 

Q6HDA3_BACHK 


Q6hda3 


bacillus tn 


11 


232 


26 . 


5 


178 


2 


^■V O TV f V "T ^ TV T T 

Q9KDJ0 BACHD 


Q9kdj 0 


bacillus ha 


12 


167 


19 . 


1 


172 


2 


Q5WHR7 BACSK 


Q5whr7 


bacillus cl 


13 


105 


12 . 


0 


190 


2 


Q2ZJH5 CALSA 


Q2zjn5 


caldicellul 


14 


99 


11 . 


3 


682 


2 


Q33NB2_METHU 


Q33nb2 


methanospir 


15 


98 . 5 


11 . 


3 


712 


1 


TBPB_NEIMB 


Q9k0v0 


neisseria m 


16 


98 . 5 


11 . 


3 


712 


2 


Q9JPI9 NEIME 


Q9 jpi9 


neisseria m 


17 


96 


11 . 


0 


357 


2 


Q7NNF5_GLOVT 


Q7nnf 5 


gloeobacter 


18 


95 


10 . 


9 


212 


2 


Q2SS5 6 MYCCA 


Q2ssb6 


mycoplasma 


19 


93 . 5 


10 . 


7 


401 


2 


Q2ZFS4_CALSA 


Q2zf s4 


caldicellul 


20 


92 . 5 


10 . 


6 


318 


2 


Q9ZDI9 RICPR 


Q9zdi9 


rickettsia 


21 


91 . 5 


10 . 


5 


263 


2 


Q5KSU5_FUGRU 


Q5ksu5 


fugu rubrip 


22 


91 


10 . 


4 


614 


2 


05118 7_B0RBU 


051187 


borrelia bu 


23 


91 


10 . 


4 


1040 


2 


Q4EJ2 3_LISM0 


Q4e j 23 


listeria mo 


24 


89 . 5 


10. 


2 


709 


2 


Q2SFX8_9GAMM 


Q2sfx8 


hahella che 


25 


89 


10 . 


2 


276 


2 


/•N f T^^l XT TV T* TV T"V 

Q 6 F7 N 6_AC IAD 


Qot /no 


acinetobact 


26 


88 . 5 


10 . 


1 


318 


2 


Q68X33 RICTY 


QooXjj 


rickettsia 


27 


87 . 5 


10 . 


0 


435 


2 


Q5NVF7_PONPY 


Q5nvf 7 


pongo pygma 


28 


87 . 5 


10. 


0 


704 


2 


Q40PH8_DESAC 


Q40ph8 


desulf uromo 


29 


87 . 5 


10 . 


0 


757 


2 


Q39Z9 6_GEOMG 


Q39z96 


geobacter m 


30 


87 


9 . 


9 


309 


2 


Q37 9K2_RHOPA 


Q379k2 


rhodopseudo 


31 


87 


9 . 


9 


340 


2 


Q5NNW0_ZYMMO 


Q5nnw0 


zymomonas m 


32 


87 


9. 


9 


802 


2 


Q60LR3_CAEBR 


Q601r3 


caenorhabdi 


33 


87 


9 . 


9 


1172 


1 


SMC2_SCHP0 


P41003 


schizosacch 


34 


86 . 5 


9 . 


9 


425 


1 


TIG_BACAN 


Q811b8 


bacillus an 


35 


86 . 5 


9 . 


9 


425 


1 


TIG BACC1 


Q72 zv3 


bacillus ce 


36 


86. 5 


9 . 


9 


425 


1 


TIG BACCR 


Q812q9 


bacillus ce 


37 


86.5 


9 . 


9 


425 


1 


TIG BACCZ 


Q633X1 


bacillus ce 


38 


86.5 


9 . 


9 


425 


1 


TIG BACHK 


Qonaoo 


oaciiius tn 


39 


8 6.5 


9 . 


9 


425 


2 


Q3EVC1 BACTI 


Q3evcl 


bacillus th 


40 


8 6.5 


9 . 


9 


425 


2 


Q4MS / 8 BACCE 




V-\ ^ /—* •{ linn a 

Daciiius ce 


41 


86.5 


9. 


9 


433 


2 


Q3TWV4_MOUSE 


Q3twv4 


mus musculu 


42 


86.5 


9. 


9 


436 


2 


Q6A0C9__MOUSE 


Q6a0c9 


mus musculu 


43 


86 


9. 


8 


507 


2 


Q8WQ14_OSTOS 


Q8wql4 


ostertagia 


44 


86 


9. 


8 


1329 


2 


Q8YYT7_ANASP 


Q8yyt7 


anabaena sp 


45 


85.5 


9. 


8 


322 


2 


Q3UCW9_MOUSE 


Q3ucw9 


mus musculu 



ALIGNMENTS 



RESULT 1 
Q65GP4_BACLD 

ID Q65GP4_BACLD PRELIMINARY; PRT; 173 AA. 

AC Q65GP4; Q62S53; 

DT 25-OCT-2004, integrated into UniProtKB/TrEMBL . 

DT 25-OCT-2004, sequence version 1. 

DT 07-FEB-2006, entry version 13. 

DE BofC. 

GN Name=bo f C ; Order edLocusNames=BL01 14 5 , BLi 02 9 02 / 

OS Bacillus lichenif ormis (strain DSM 13 / ATCC 14580). 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBIJTaxID=27 9010; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RX PubMed=15383718; DOI=10 . 1159/000079829; 
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RA Veith B. , Herzberg C, Steckel S., Feesche J., Maurer K.H., 

RA Ehrenreich P., Baeumer S., Henne A. , Liesegang H., Merkl R., 

RA Ehrenreich A., Gottschalk G. ; 

RT "The complete genome sequence of Bacillus lichenif ormis DSM13 , an 

RT organism with great industrial potential."; 

RL J. Mol. Microbiol. Biotechnol. 7:204-211(2004). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RX PubMed=l 54 61803; DOI=10 . 1186/gb-2004-5-10-r77 ; 

RA Rey M.W. , Ramaiya P., Nelson B.A., Brody-Karpin S.D., Zaretsky E.J., 

RA Tang M. , Lopez de Leon A., Xiang H., Gusti V., Clausen I.G., 

RA Olsen P.B., Rasmussen M.D., Andersen J.T., Joergensen P.L., 

RA Larsen T.S., Sorokin A., Bolotin A., Lapidus A., Galleron N., 

RA Ehrlich S.D., Berka R.M.; 

RT "Complete genome sequence of the industrial bacterium Bacillus 

RT lichenif ormis and comparisons with closely related Bacillus species."; 

RL Genome Biol. 5 :RESEARCH077 . 1-RESEARCH077 . 12 (2004 ) . 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AE017333; AAU41770.1; -; Genomic_DNA. 

DR EMBL; CP000002; AAU24407.1; -; Genomic_DNA. 

KW Complete proteome. 

SQ SEQUENCE 173 AA; 19539 MW; 00319BEEFBB2CFAC CRC64; 

Query Match 100.0%; Score 875; DB 2; Length 173; 

Best Local Similarity 100.0%; Pred. No. 6e-66; 

Matches 17 3; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MYSRSKFKIGLLLIGSLLAALSFHLEALAEKPAKVQIQLEKVYLDGDVGIENKVEAARTL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MYSRSKFKIGLLLIGSLLAALSFHLEALAEKPAKVQIQLEKVYLDGDVGIENKVEAARTL 60 

Qy 61 EDFKAAYKGWQLIDQKKGFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGILSEP 120 

I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 EDFKAAYKGWQLIDQKKGFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGILSEP 120 

Qy 121 IQS FFQIDIKRLESRMADDLRKGI PYRTKKEFEHVIEAVKSSGSQHHVEDMKT 17 3 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 IQS FFQIDIKRLESRMADDLRKGI PYRTKKEFEHVIEAVKSSGSQHHVEDMKT 173 



RESULT 2 
BOFC_BACSU 

ID BOFC_BACSU STANDARD; PRT; 170 AA. 

AC 005391; 

DT 30-MAY-2000, integrated into UniProtKB/Swiss-Prot . 

DT 01-JUL-1997, sequence version 1. 

DT 07-MAR-2006, entry version 37. 

DE Protein bofC precursor ( Bypass-of-f orespore protein C) . 

GN Name=bof C; OrderedLocusNames=BSU2 7750; 

OS Bacillus subtilis. 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=1423; 
RN [1] 

RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] . 

RC STRAIN=168 / PY7 9; 

RX MEDLINE=9717 7783; PubMed=902 5289; 

RA Gomez M. , Cutting S.M.; 

RT "BofC encodes a putative forespore regulator of the Bacillus subtilis 
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RT sigma K checkpoint."; 

RL Microbiology 143:157-170(1997). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [GENOMIC DNA] . 

RC STRAIN=168; 

RA Tosato V., Bolotin A., Bertani I., Valentino I., Bruschi C.V.; 

RT "A 17.8 kb segment in the spoVB-nadC region of the Bacillus subtilis 

RT 168 chromosome: sequencing and ruv operon identification."; 

RL Submitted (DEC-1997) to the EMBL/ GenBank/DDB J databases. 

RN [3] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=168; 

RX MEDLINE=98044033; PubMed=9384377 ; DOI=10 . 1038/36786; 

RA Kunst F., Ogasawara N., Moszer I., Albertini A.M., Alloni G., 

RA Azevedo V., Bertero M.G., Bessieres P., Bolotin A., Borchert S., 

RA Borriss R., Boursier L., Brans A., Braun M., Brignell S.C., Bron S., 

RA Brouillet S., Bruschi C.V., Caldwell B., Capuano V., Carter N.M., 

RA Choi S.-K., Codani J. -J., Connerton I.F., Cummings N.J., Daniel R.A. , 

RA Denizot F. , Devine K.M., Duesterhoeft A., Ehrlich S.D., Emmerson P.T., 

RA Entian K.-D., Errington J., Fabret C, Ferrari E., Foulger D., 

RA . Fritz C, Fujita M. , Fujita Y., Fuma S., Galizzi A., Galleron N., 

RA Ghim S.-Y., Glaser P., Goffeau A., Golightly E.J., Grandi G., 

RA Guiseppi G., Guy B.J., Haga K., Haiech J., Harwood C.R., Henaut A., 

RA Hilbert H., Holsappel S., Hosono S., Hullo M.-F., Itaya M. , 

RA Jones L.-M., Joris B., Karamata D., Kasahara Y . , Klaerr-Blanchard M. , 

RA Klein C, Kobayashi Y., Koetter P., Koningstein G., Krogh S., 

RA Kumano M . , Kurita K. , Lapidus A., Lardinois S., Lauber J., 

RA Lazarevic V., Lee S.-M., Levine A., Liu H., Masuda S., Mauel C, 

RA Medigue C, Medina N . , Mellado R.P., Mizuno M., Moestl D., Nakai S., 

RA Noback M., Noone D., O'Reilly M. , Ogawa K., Ogiwara A., Oudega B., 

RA Park S.-H., Parro V., Pohl T.M., Portetelle D., Porwollik S., 

RA Prescott A.M., Presecan E., Pujic P., Purnelle B. , Rapoport G., 

RA Rey M., Reynolds S., Rieger M. , Rivolta C, Rocha E., Roche B., 

RA Rose M., Sadaie Y., Sato T., Scanlan E., Schleich S., Schroeter R., 

RA Scoffone F., Sekiguchi J., Sekowska A., Seror S.J., Serror P., 

RA Shin B.-S., Soldo B., Sorokin A., Tacconi E., Takagi T., Takahashi H., 

RA Takemaru K., Takeuchi M. , Tamakoshi A., Tanaka T., Terpstra P., 

RA Tognoni A., Tosato V., Uchiyama S., Vandenbol M. , Vannier F., 

RA Vassarotti A., Viari A., Wambutt R., Wedler E., Wedler H., 

RA Weitzenegger T., Winters P., Wipat A., Yamamoto H., Yamane K. , 

RA Yasumoto K., Yata K. , Yoshida K. , Yoshikawa H.-F., Zumstein E., 

RA Yoshikawa H., Danchin A. ; 

RT "The complete genome sequence of the Gram-positive bacterium Bacillus 

RT subtilis."; 

RL Nature 390:249-256(1997). 

RN [4] 

RP STRUCTURE BY NMR OF 31-170, AND SUBUNIT . 

RX PubMed=16049010; DOI=10 . 1074/ jbc . M506910200; 

RA Patterson H.M., Brannigan J. A., Cutting S.M., Wilson K.S., 

RA Wilkinson A. J., Ab E., Diercks T., de Jong R.N., Truf fault V., 

RA Folkers G.E., Kaptein R.; 

RT "The structure of bypass of forespore C, an intercompartmental 

RT signaling factor during sporulation in Bacillus."; 

RL J. Biol. Chem. 2 80:3 6214-36220(2005). 

CC -!- FUNCTION: Inhibits the spoIVB zymogen from undergoing 

CC autocatalytic activation by an unknown mechanism, and in this way 

CC plays a role in the sigma-K checkpoint of sporulation. 

CC -!- SUBUNIT: Monomer. 

CC -!- SUBCELLULAR LOCATION: Forespore intermembrane space. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

http://es/ScoreAccessWeb/GetItem.action?AppId=10510408&seqId=530059&Item 6/7/06 



SCORE Search Results Details for Application 10510408 and Search Result us- 10-5 10-4... Page 5 of 17 



cc 


Distributed under 


the Creative Commons Attribution-NoDerivs License 


cc 






















UK 


EMBL; X93081; CAA63621. 


l; 


f benoinic uwa. 


UK 


EMBL; Y15896; CAB75329. 


1; 


- ; Genomic_DNA . 


UK 


EMBL; Z99118; CAB14735. 


1/ 


- ; Genomic_DNA . 


UK 


PIR; A69596; A69596. 






DR 


PDB; 2BW2; 


NMR; A= 


31-170. 




DR 


GenomeRe views; AL00912 6 GR 


.; BSU27750. 


DR 


SubtiList; 


BG11917; bofC. 




DR 


BioCyc; BSUB1423 : BSU27 71-MONOMER; 


KW 


3D-structure; Complete 


proteome; Signal; Sporulation. 


FT 


SIGNAL 


1 


30 




Probable . 


FT 


CHAIN 


31 


170 




Protein bofC. 


FT 










/FTIa=PRO 0000020824 . 


FT 


VARIANT 


96 


96 




S -> F (in bof CI ) . 


FT 


STRAND 


34 


48 






FT 


STRAND 


50 


52 






FT 


STRAND 


54 


65 






FT 


HELIX 


66 


72 






FT 


STRAND 


73 


73 






FT 


TURN 


74 


75 






FT 


STRAND 


76 


82 






FT 


TURN 


83 


84 






FT 


STRAND 


85 


91 






FT 


STRAND 


93 


94 






FT 


TURN 


97 


97 






FT 


HELIX 


98 


100 






FT 


TURN 


101 


101 






FT 


STRAND 


105 


108 






FT 


TURN 


109 


111 






FT 


STRAND 


112 


119 






FT 


STRAND 


121 


121 






FT 


TURN 


122 


123 






FT 


STRAND 


124 


124 






FT 


STRAND 


126 


127 






£ 1 


TURN 


135 


137 






FT 


STRAND 


138 


138 






FT 


HELIX 


140 


148 






FT 


STRAND 


151 


151 






FT 


HELIX 


155 


168 






SQ 


SEQUENCE 


170 AA; 


19296 


MW; 2786FEE1DBDFD2A2 CRC64; 



Query Match 55.6%; Score 48 6.5; DB 1; Length 170; 

Best Local Similarity 58.8%; Pred. No. 4.7e-33; 

Matches 97; Conservative 26; Mismatches 33; Indels 9; Gaps 



1; 



Qy 

Db 

Qy 

Db 

Qy 

Db 



6 KFKIGLLLIGSLLAALSFHLEALAE- KPAKVQIQLEKVYLDGDVGIENKVEA 56 

: I I I : I I : I I : I : : I : I : I I I I I I I I I I I I I : I I 

3 RFSTAYLLLGILCSAAVFLIGAPSRALGAEVEHYEPLQVHVQLEKVYLDGDVSIEHKHEK 62 

57 ARTLEDFKAAYKGWQLIDQKKGFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGI 116 

: : : I I III II I : : I I I I : : I I I I I : I I I I I I I I I I I I I I : : : I II I I I I I I I 
63 VFSMDD FWAAYAGWT LVEQKKGYVLFRKQMDD I S PLSKVNGY I GVS DNGVI S T FHGRPE P 122 

117 LSEPIQSFFQIDIKRLESRMADDLRKGIPYRTKKEFEHVIEAVKS 161 ' 

I I I I I I I I I I I : : I I I I I : I I I I I : I I I I I I III : I : 
123 ASEPIQSFFQIDLERLESHMQKNLLKGIPFRTKAEFEDVIEHMKT 167 



RESULT 3 
Q5KWQ8 GEOKA 
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ID Q5KWQ8_GEOKA PRELIMINARY; PRT; 158 AA. 

AC Q5KWQ8; 

DT 01-FEB-2005, integrated into UniProtKB/TrEMBL . 

DT 01-FEB-2005, sequence version 1. 

DT 07-FEB-2006, entry version 5. 

DE Bypass-of-f orespore protein C (Forespore regulator of the sigma-K 

DE checkpoint) . 

GN OrderedLocusNames=GK2 593; 

OS Geobacillus kaus tophilus . 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Geobacillus. 

OX NCBI_TaxID=l 4 62 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=HTA42 6; 

RX PubMed=15576355; DOI=10 . 1093/nar/gkh970; 

RA Takami H., Takaki Y., Chee G.-J., Nishi S., Shimamura S., Suzuki H., 

RA Matsui S . , Uchiyama I . ; 

RT "Thermoadaptation trait revealed by the genome sequence of 

RT thermophilic Geobacillus kaustophilus . " ; 

RL Nucleic Acids Res. 32:6292-6303(2004). 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BA000043; BAD7 687 8.1; -; Genomic_DNA. 

KW Complete proteome . 

SQ SEQUENCE 158 AA; 18254 MW; C5CB88B037E88C3 6 CRC64; 

Query Match 38.2%; Score 334; DB 2; Length 158; 
Best Local Similarity 45.7%; Pred. No. 3.6e-20; 

Matches 64; Conservative 30; Mismatches 46; Indels 0; Gaps 0; 

Qy 18 IJ^LSFHLEALAEKPAKVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAYKGWQLIDQKK 77 

: I I : : : : I I : I I I : I I I I : : I I : : : I : I I I I : 

Db 10 IAAIVLPVHSASAAPVKMTIVLERQYLDGEMSEEKVTETVDSMTEIWKKYRGWQLVTLDD 69 

Qy 78 GFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMA 137 

I : I I I : : I I I I I I I I I I I : I : I I : I I : I : I I I I I I I I I I I I ::: I I I I 

Db 70 QTIVFRKTINDISPLLKTNGYFGITDDGTLSIFNGKPGRSSEIIQSFFQIDVQKLESRQQ 129 

Qy 138 DDLRKGIPYRTKKEFEHVIE 157 

: I : I I I : I : : I I I I 

Db 130 EKLKKGI RVLS KERYEQVI E 149 



RESULT 4 
Q730B0_BACC1 

ID Q730B0_BACC1 PRELIMINARY; PRT; 176 AA. 

AC Q730B0; 

DT 05-JUL-2004, integrated into UniProtKB/TrEMBL. 

DT 05-JUL-2004 , sequence version 1. 

DT 07-FEB-2006, entry version 9. 

DE Bypass-of-f orespore protein C, putative. 

GN ORFName s =BCE_4 507; 

OS Bacillus cereus (strain ATCC 10987). 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; 

OC Bacillus cereus group. 

OX NCBI_TaxID=22252 3; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RX PubMed=14960714; DOI=10 . 1093/nar/gkh258 ; 
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RA Rasko D.A., Ravel J., Oekstad O.A., Helgason E., Cer R.Z., Jiang L., 

RA Shores K.A., Fouts D.E., Tourasse N.J., Angiuoli S.V., Kolonay J.F., 

RA Nelson W.C., Kolstoe A.-B., Fraser CM., Read T.D.; 

RT "The genome sequence of Bacillus cereus ATCC 10987 reveals metabolic 

RT adaptations and a large plasmid related to Bacillus anthracis pXOl."; 

RL Nucleic Acids Res. 32:977-988(2004). 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AE017194; AAS43408.1; -; Genomic_DNA. 

KW Complete proteome. 

SQ SEQUENCE 176 AA; 20490 MW; B4 7F265C88E5D950 CRC64; 



Query Match 32.2%; Score 281.5; DB 2; Length 176; 

Best Local Similarity 43.7%; Pred. No. l.le-15; 

Matches 62; Conservative 25; Mismatches 54; Indels 1; Gaps 1; 

Qy 28 IAEKPAKVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAYKGWQLIDQKKGFILFRKQVD 87 

: I I : I I I I : : I : I I : I I I III I I I I I : I : I : : I : I I 
Db 32 VTEKDPQVTILLERKYWGEVSEEIFTEKVADLEKFLQQYKEWQLVDRDDVQIVLQKKVD 91 

Qy 88 DISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYR 147 

Mill 11:11 ll:|:|:: III I MlllhCIII |::|| : 

Db 92 DISPLLKTSGYFGVSEEGILQIFKGVPK-SDNAIHSFFQIDMKKLESYERAKLKRGIRIK 150 

Qy 148 TKKE FEHVI EAVKS S GS QHHVE 169 

: I : I I I : I I : I 
Db 151 S KEGFVKT I EKMKQYAVQNRKE 172 



RESULT 5 
Q4MS28 BACCE 



ID Q4MS2 8_BACCE PRELIMINARY; PRT; 17 5 AA. 

AC Q4MS28; 

DT 02-AUG-2005, integrated into UniProtKB/TrEMBL . 

DT 02-AUG-2005, sequence version 1. 

DT 07-FEB-2006, entry version 2. 

DE BofC protein. 

GN ORFNames=BCE_G92 4 1_4 47 9; 

OS Bacillus cereus G9241. 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; 

OC Bacillus cereus group. 

OX NCBIJTax ID=2 69801; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=G9241; 

RX PubMed-1 5155910; DOI=10 . 1073/pnas . 04 02414 101; 

RA Hoffmaster A.R., Ravel J., Rasko D.A., Chapman G.D., Chute M.D., 

RA Marston C.K., De B.K., Sacchi C.T., Fitzgerald C, Mayer L.W., 

RA Maiden M.C.J. , Priest F.G., Barker M. , Jiang L., Cer R.Z., 

RA Rilstone J., Peterson S.N., Weyant R.S., Galloway D.R., Read T.D., 

RA Popovic T., Fraser CM.; 

RT "Identification of anthrax toxin genes in a Bacillus cereus associated 

RT with an illness resembling inhalation anthrax."; 

RL Proc. Natl. Acad. Sci. U.S.A. 101:8449-8454(2004). 

CC -!- CAUTION: The sequence shown here is derived from an 

CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is 

CC preliminary data. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 
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CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AAEK01000010 / EAL14975.1; -; Genomic_DNA. 

SQ SEQUENCE 175 AA; 20211 MW; 4056A6CA492FABE7 CRC64; 

Query Match 32.1%; Score 280.5; DB 2; Length 175; 

Best Local Similarity 43.4%; Pred. No. 1.4e-15; 

Matches 63; Conservative 27; Mismatches 50; Indels 5; Gaps 2; 

Qy 28 IJuliKPAKVQIQLEKVYLDGDVGIENKVlIAARTLEDFKAAYKGWQLIDQKKGFILFRKQVT) 87 

: I i : I I I I : : I : I I : I I I III I I I I I : I : I : : I : I I 
Db 32 VTEKEPEVTILLERMYVDGEVSEEIFTEKVADLEKFLQQYKEWQLVDRDDVQIVLQKKVD 91 

Qy 88 DISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYR 147 

Mill 11:11 III I llllll:|:lll |::|| : 

Db 92 DISPLLKTSGY FGVSEEGILQIFKGVPK-SDNAIHSFFQIDMKKLESYERAKLKRGIRIK 150 

Qy 148 TKKEFEHVIEAVKSSGSQHHVEDMK 172 

: I : I II : I I : I : : I 
Db 151 SKEGFVKTIEKMK QYAVQNKK 171 



RESULT 6 
Q634C1 BACCZ 



ID 


Q634C1__BACCZ PRELIMINARY; PRT; 176 AA. 




AC 


Q634C1; 




DT 


25-OCT-2004, integrated into UniProtKB/TrEMBL . 




DT 


25-OCT-2004, sequence version 1. 




DT 


07-FEB-2006, entry version 10. 




DE 


Probable bypass-of-f orespore protein C. 




GN 


Name=bof C; OrderedLocusNames=BCE33L4167; 




OS 


Bacillus cereus (strain ZK / E33L) . 




oc 


Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; 


oc 


Bacillus cereus group. 




ox 


NCBI TaxID=288681; 




RN 


[1] 




RP 


NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 




RA 


Brettin T.S., Bruce D., Challacombe J.F., Gilna 


P., Han C, Hill K., 


RA 


Hitchcock P., Jackson P., Keim P., Longmire J., 


Lucas S., Okinaka R., 


RA 


Richardson P., Rubin E., Tice H.; 




RT 


"Complete genome sequence of Bacillus cereus ZK 




RL 
CC 
CC 


Submitted (JUL-2004) to the EMBL/ GenBank/DDB J databases. 


Copyrighted by the UniProt Consortium, see http 


: //www. uniprot.org/terms 


CC 
CC 
DR 


Distributed under the Creative Commons Attribution-NoDerivs License 


EMBL; CP000001; AAU16102.1; -; Genomic_DNA. 




KW 


Complete proteome. 




SQ 


SEQUENCE 176 AA; 20474 MW; AEAE 4 6 A6CA4 1 BE 3 4 


CRC64; 



Query Match 31.9%; Score 279.5; DB 2; Length 176; 

Best Local Similarity 43.4%; Pred. No. 1.7e-15; 

Matches 63; Conservative 27; Mismatches 50; Indels 5; Gaps 2; 

Qy 28 LAEKPAKVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAYKGWQLIDQKKGFILFRKQVD 87 

: I I : I I I I : : I : I I : I I I III I I I I I : I : I : : I : I I 
Db 32 VTEKEPEVTILLERMYVDGEVSEEIFTEKVADLEKFLQQYKEWQLVDRDDVQIVLQKKVD 91 

Qy 88 DISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYR 147 

I I I I I I I : I I I I : I : I : : III I I I I I I I : I : I I I I :: I I : 

Db 92 DISPLLKTSGY FGVSEEGILQIFKGVPK-SDNAIHSFFQIDMKKLESYERVKLKRGIRIK 150 



http://es/ScoreAccessWeb/GetItem.action?AppId=10510408&seqId=530059&ItemNam 6/7/06 



SCORE Search Results Details for Application 10510408 and Search Result us-10-5 10-4... Page 9 of 17 



Qy 148 TKKEFEHVIEAVKSSGSQHHVEDMK 172 

: I : I I I : I I : I : : I 
Db 151 SKEGEVKTIEKMK QYAVQNKK 171 

RESULT 7 
Q3EJ53_BACTI 

ID Q3EJ53_BACTI PRELIMINARY; PRT; 176 AA. 

AC Q3EJ53; 

DT 08-NOV-2005, integrated into UniProtKB/TrEMBL . 

DT 08-NOV-2005, sequence version 1. 

DT 07-FEB-2006, entry version 3. 

DE BofC protein. 

GN ORFNames=RBTH_01388; 

OS Bacillus thuringiensis serovar israelensis ATCC 35646. 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; 

OC Bacillus cereus group. 

OX NCBI_TaxID=339854; 

RN [1] 

RP NUCLEOTIDE SEQUENCE . 

RC STRAIN=ATCC 3564 6; 

RA Anderson I., Sorokin A., Kapatral V., Reznik G., Bhattacharya A., 

RA Mikhailova N., Burd H., Joukov V., Kaznadzey D., Walunas T., 

RA D'Souza M. , Larsen N., Pusch G., Liolios K., Grechkin Y . , Lapidus A., 

RA Goltsman E., Chu L., Fonstein M. , Ehrlich D., Overbeek R., 

RA Kyrpides N . , Ivanova N.; 

RT "Comparative genome analysis of Bacillus cereus group genomes with 

RT Bacillus subtilis."; 

RL Submitted (SEP-2005) to the EMBL/GenBank/DDBJ databases. 

CC -!- CAUTION: The sequence shown here is derived from an 

CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is 

CC preliminary data. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AAJM010008 38 ; EA051343.1; -; Genomic_DNA. 

SQ SEQUENCE 176 AA; 20535 MW; 74 4FD0FDEF8EB665 CRC64; 

Query Match 31.6%; Score 276.5; DB 2; Length 176; 

Best Local Similarity 42.8%; Pred. No. 3e-15; 

Matches 62; Conservative 28; Mismatches 50; Indels 5; Gaps 2; 

Qy 28 LAEKPAKVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAYKGWQLIDQKKGFILFRKQVD 87 

: : I : I I I I : : I : I I : I I I I I I I I I I I : I : I : : I : I I 
Db 32 VTKKEPEVTILLERMYVDGEVSEEILTEKWDLEKFLQQYKEWQLVDRDDVQIVLQKKVD 91 

Qy 88 DISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYR 147 

I I I I I I I : I I I I : I : I : : III I I I I I I I : I : I I I I : : I I : 

Db 92 DISPLLKTSGYFGVSEEGILQIFKGVPK-SDNAIHSFFQIDMKKLESYERAKLKRGIRIK 150 

Qy 148 TKKEFEHVIEAVKSSGSQHHVEDMK 172 

:|: I II :| I: |:: I 
Db 151 SKEGFVRTIEKMK QYAVQNKK 171 

RESULT 8 
Q817W1_BACCR 

ID Q817W1_BACCR PRELIMINARY; PRT; 17 6 AA. 
AC Q817W1; 
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DT 01-JUN-2003, integrated into UniProtKB/TrEMBL . 

DT 01-JUN-2003, sequence version 1. 

DT 07-FEB-2006, entry version 10. 

DE BofC protein. 

GN OrderedLocusNames=BC4417; ORFNames=BC_4417 ; 

OS Bacillus cereus (strain ATCC 14579 / DSM 31) . 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; 

OC Bacillus cereus group. 

OX NCBI_TaxID=226900; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RX MEDLINE=22608415; PubMed=12721630; DOI=10 . 1038/nature01582 ; 

RA Ivanova N., Sorokin A., Anderson I., Galleron N., Candelon B., 

RA Kapatral V., Bhattacharyya A., Reznik G., Mikhailova N., Lapidus A., 

RA Chu L. , Mazur M. , Goltsman E., Larsen N. , D'Souza M. , Walunas T., 

RA Grechkin Y., Pusch G., Haselkorn R., Fonstein M. , Ehrlich S.D., 

RA Overbeek R., Kyrpides N.C.; 

RT "Genome sequence of Bacillus cereus and comparative analysis with 

RT Bacillus anthracis . " ; 

RL Nature 423:87-91(2003). 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AE016877; AAP11330.1; -; Genomic_DNA. 

DR BioCyc; BCER226900 : BC4417-MONOMER; -. 

KW Complete proteome. 

SQ SEQUENCE 176 AA; 20442 MW; 7 642 688DF1C20982 CRC64; 



Query Match 31.6%; Score 276.5; DB 2; Length 176; 

Best Local Similarity 42.8%; Pred. No. 3e-15; 

Matches 62; Conservative 27; Mismatches 51; Indels 5; Gaps 2; 

Qy 28 LAE KPAKVQ I Q LE KV Y LDGDVG I ENKVEAART LED FKAAYKGWQ L I DQ KKG F I L FRKQVD 87 

: :| : I I ||::|:||:| I I III II lll:|: I: :|:M 
Db 32 VTKKEPEVTILLERMYVDGEVSEEILTEKVTDLEKFLQQYKEWQLVDRDDVQIVLQKKVD 91 

Qy 88 DISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYR 147 

Mill 11:11 ll:|:|:: III I llllll:|:IM |::|| : 

Db 92 DISPLLKTSGYFGVSEEGILQIFKGVPK-SDNAIHSFFQIDMKKLESYERAKLKRGIRIK 150 



Qy 148 TKKEFEHVIEAVKSSGSQHHVEDMK 172 

: I : I I I : I I : I : I 
Db 151 SKEGFVKTIEKMK QYAVQSKK 171 



RESULT 9 

Q 8 1 LG 6_BACAN 

ID Q 8 1 L G 6_B AC AN PRELIMINARY; PRT; 176 AA. 

AC Q81LG6; Q6HSX3; Q6KM61; 

DT 01-JUN-2003, integrated into UniProtKB/TrEMBL. 

DT 01-JUN-2003, sequence version 1. 

DT 07-FEB-2006, entry version 15. 

DE Bypass-of-forespore protein C, putative. 

GN OrderedLocusNames=BA4 653 / BAS4318, GBAA4653; ORFNames=BA_4653 ; 

OS Bacillus anthracis. 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; 

OC Bacillus cereus group. 

OX NCBI_TaxID=1392; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 
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RC STRAIN=Ames / isolate Porton; 

RX MEDLINE=22608414; PubMed=12721629 ; DOI=10 . 1038/nature0158 6; 

RA Read T.D., Peterson S.N., Tourasse N.J., Baillie L.W., Paulsen I.T., 

RA Nelson K.E., Tettelin H., Fouts D.E., Eisen J. A., Gill S.R., 

RA Holtzapple E.K., Okstad O.A., Helgason E . , Rilstone J . , Wu M. , 

RA Kolonay J.F., Beanan M.J., Dodson R.J., Brinkac L.M., Gwinn M.L., 

RA DeBoy R.T., Madpu R., Daugherty S.C., Durkin A.S., Haft D.H., 

RA Nelson W.C., Peterson J.D., Pop M. , Khouri H.M., Radune D., 

RA Benton J.L., Mahamoud Y., Jiang L., Hance I.R., Weidman J.F., 

RA Berry K.J., Plaut R.D., Wolf A.M., Watkins K.L., Nierman W.C., 

RA Hazen A., Cline R.T., Redmond C, Thwaite J.E., White O., 

RA Salzberg S.L., Thomason B., Friedlander A.M. , Koehler T.M., 

RA Hanna P.C., Kolstoe A.-B., Fraser CM./ 

RT "The genome sequence of Bacillus anthracis Ames and comparison to 

RT closely related bacteria."; 

RL Nature 423:81-86(2003). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=Ames ancestor; 

RA Ravel J., Rasko D.A., Shumway M.F., Jiang L., Cer R.Z., Federova N.B., 

RA Wilson M., Stanley S., Decker S., Read T.D., Salzberg S.L., 

RA Fraser C .M. ; 

RT "Bacillus anthracis comparative genomics."; 

RL Submitted (MAY-2004) to the EMBL/GenBank/DDB J databases. 

RN [3] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=Sterne; 

RA Brettin T.S., Bruce D., Challacombe J.F., Gilna P., Han C, Hill K., 

RA Hitchcock P., Jackson P., Keim P., Longmire J., Lucas S., Okinaka R., 

RA Richardson P., Rubin E., Tice H.; 

RT "Complete genome sequence of Bacillus anthracis Sterne."; 

RL Submitted (JAN-2004) to the EMBL/GenBank/DDB J databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AE016879; AAP28356.1; -; Genomic_DNA. 

DR EMBL; AE017334; AAT33775.1; Genomic_DNA. 

DR EMBL; AE017225; AAT56616.1; -; Genomic_DNA. 

DR TIGR; BA4 653; -. 

DR TIGR; GBAA4653; 

KW Complete proteome. 

SQ SEQUENCE 176 AA; 20435 MW; B2DE122C23A63145 CRC64; 

Query Match 31.6%; Score 276.5; DB 2; Length 176; 

Best Local Similarity 42.8%; Pred. No. 3e-15; 

Matches 62; Conservative 28; Mismatches 50; Indels 5; Gaps 2; 

Qy 2 8 LMKPAKVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAYKGWQLIDQKKGFILFRKQVD 87 

: I I : I I I I : : I : I I : I I I I I : I I I I I : I : I : : I : I I 
Db 32 VTEKEPEVTILLERMYVDGEVSEEIFTEKVTUDLENILQQYKEWQLVDRDDVQIVLQKKVD 91 

Qy 88 DISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYR 147 

I I I I I I I : I I I I : I : I : : III I I I I I I I : I : I I I I : : I I : 

Db 92 DISPLLKTSGYFGVSEEGILQIFKGVPK-SDNAIHSFFQIDMKKLESYERAKLKRGIRIK 150 

Qy 148 TKKEFEHVIEAVKSSGSQHHVEDMK 172 

: I : I I I : I I : I : : I 
Db 151 SKEGFVKTIEKMK QYAVQNKK 171 
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RESULT 10 
Q6HDA3 BACHK 



ID Q6HDA3_BACHK PRELIMINARY; PRT; 17 6 AA. 

AC Q6HDA3; 

DT 19-JUL-2004, integrated into UniProtKB/TrEMBL. 

DT 19-JUL-2004, sequence version 1. 

DT 07-FEB-2006, entry version 9. 

DE Probable bypass-of-f orespore protein C. 

GN Name=bofC; OrderedLocusNames=BT9727_4156; 

OS Bacillus thuringiensis subsp. konkukian. 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus; 

OC Bacillus cereus group. 

OX NCBI_TaxID=180856; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=97-27; 

RA Brettin T.S., Bruce D., Challacombe J.F., Gilna P., Han C, Hill K., 

RA Hitchcock P., Jackson P., Keim P., Longmire J., Lucas S., Okinaka R., 

RA Richardson P., Rubin E., Tice H.; 

RT "Complete genome sequence of Bacillus thuringiensis 97-27."; 

RL Submitted (JUN-2004) to the EMBL/GenBank/DDBJ databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AE017355; AAT63714.1; -; GenomicJDNA. 

KW Complete proteome. 

SQ SEQUENCE 176 AA; 20412 MW; 88752B059C739EFB CRC64; 

Query Match 31.4%; Score 274.5; DB 2; Length 176; 

Best Local Similarity 42.8%; Pred. No. 4.4e-15; 

Matches 62; Conservative 27; Mismatches 51; Indels 5; Gaps 2; 

Qy 28 LAEKPAKVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAYKGWQLIDQKKGFILFRKQVD 87 

: I I : | | | | : : | : | | : I I I I I I I I I I : I : I : : I : I I 

Db 32 VTEKEPEVTILLERMYVDGEVSEEIFTEKVADLEKILQQYKEWQLVDRDDVQIVLQKKVD 91 

Qy 88 DISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYR 147 

I I I I I I I : I I I I : I : I : : III I I I I I I I : I : I I I I : : I I : 

Db 92 DISPLLKTSGYFGVSEEGILQIFKGVPK-SDNAIHSFFQIDMKKLESYERAKLKRGIRIK 150 

Qy 148 TKKEFEHVIEAVKSSGSQHHVEDMK 172 

: I : I I I : I I : I : : I 
Db 151 SKEGFVKTIEKMK QYAVQNKK 171 



RESULT 11 
Q9KDJ0_BACHD 

ID Q9KDJ0_BACHD PRELIMINARY; PRT; 178 AA. 

AC Q9KDJ0; 

DT 01-OCT-2000, integrated into UniProtKB/TrEMBL. 

DT 01-OCT-2000, sequence version 1. 

DT 07-FEB-2006, entry version 16. 

DE Forespore regulator of the sigma-K checkpoint. 

GN OrderedLocusNames=BH1223; 

OS Bacillus halodurans. 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=8 6665 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [ LARGE SCALE GENOMIC DNA] . 

RC STRAIN=C-125 / JCM 9153; 
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RX MEDLINE=20512582; PubMed=l 10581 32 ; DOI=10 . 1093/nar/28 . 21 . 4317 ; 

RA Takami H., Nakasone K. , Takaki Y., Maeno G. , Sasaki R., Masui N . , 

RA Fuji F. , Hirama C, Nakamura Y . , Ogasawara N . , Kuhara S., 

RA Horikoshi K. ; 

RT "Complete genome sequence of the alkaliphilic bacterium Bacillus 

RT halodurans and genomic sequence comparison with Bacillus subtilis."; 

RL Nucleic Acids Res. 28:4317-4331(2000). 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; BA000004; BAB04942.1; -; Genomic_DNA. 

DR PIR; G83802; G83802 . 

DR BioCyc; BHAL8 6665 : BH1223-MONOMER; -. 

KW Complete proteome. 

SQ SEQUENCE 178 AA; 20241 MW; 5902515B57EA72C1 CRC64; 

Query Match 26.5%; Score 232; DB 2; Length 178; 

Best Local Similarity 32.7%; Pred. No. 1.8e-ll; 

Matches 4 9; Conservative 32; Mismatches 65; Indels 4; Gaps 3; 

Qy 9 IGLLLIGSLLAALSFHLEAIJ^-KPAKVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAY 67 

: I I I : : I : II : : : I I : I : I I : : I I hi I 

Db 20 VNFFLISSIVAMGGNQIAVAAETNSTSISVRLERHYVDGEISEEYITEPIETIEQLMGQY 79 

Qy 68 KGWQLIDQKKGFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGILSEP-IQSFFQ 12 6 

II : I : : : I I I : I I I : I I I : : : I I : : : I : I I I I I I : 

Db 80 HGWIFVSQNGDEMILKKDVPDLSPLTKANGFFSISNDGQLNLYDG — SAEEQPIIQSFFR 137 

Qy 127 IDIKRLESRMADDLRKGIPYRTKKEFEHVI 156 

: I I : I I I : I I I I I I : : I : 
Db 138 LDTKKLESGLHDQLRSGIRIGTLDDYREVL 167 



RESULT 12 
Q5WHR7_BACSK 

ID Q5WHR7_BACSK PRELIMINARY; PRT; 172 AA. 

AC Q5WHR7; 

DT 23-NOV-2004, integrated into UniProtKB/TrEMBL . 

DT 23-NOV-2004, sequence version 1. 

DT 07-FEB-2006, entry version 8. 

DE Forespore regulator of the sigma-K checkpoint. 

GN OrderedLocusNames=ABC1553; 

OS Bacillus clausii (strain KSM-K16) . 

OC Bacteria; Firmicutes; Bacillales; Bacillaceae; Bacillus. 

OX NCBI_TaxID=66692 ; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RA Takaki Y., Kageyama Y., Shimamura S., Suzuki H., Nishi S., Hatada Y., 

RA Kawai S., Ito S. # Horikoshi K. ; 

RT "The complete genome sequence of the alkaliphilic Bacillus clausii 

RT KSM-K16 . " ; 

RL Submitted (OCT-2003) to the EMBL/ GenBank/DDB J databases. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AP006627; BAD64088.1; -; Genomic_DNA. 

KW Complete proteome. 

SQ SEQUENCE 172 AA; 19409 MW; 8BBDC809CB1048A2 CRC64 ; 
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Query Match 19.1%; Score 167; DB 2; Length 172; 

Best Local Similarity 26.8%; Pred. No. 5.5e-06; 



Matches 42; Conservative 34; Mismatches 73; Indels 8; Gaps 3; 

Qy 7 FKIG LLLIGSLLAA LSFHLEALAEKPAKVQIQLEKVYLDGDVGIENKVEAART 59 

I : I 11:1: : I :| I :: I I: II:: I I 

Db 13 FS LGAAS FALL LGT F 1 1 LG EDAS N DS PQ VAN P PNAT E VLRE T V FADGN I RT E RVE E T I WA 72 

Qy 60 LEDFKAAYKGWQLIDQKKGFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGILSE 119 

: : I I I I : I I I : I I I : I I I : I : I I I : I I : I : I I : I : 
Db 73 IDDFWAEYEEWQLVDQSNHFVHFRKTILDLSPEVKEDGYLGIDADRGIVLYTGAQE-AGQ 131 

Qy 120 PIQSFFQIDIKRLESRMADDLRKGIPYRTKKEFEHVI 156 

: : : | : : | I : I I : : : : : : 

Db 132 IVTRYSEIEAAKLSESDVAALEQGIRIESLEHLQNLL 168 



RESULT 13 
Q2ZJH5 CALSA 



ID Q2ZJH5_CALSA PRELIMINARY; PRT; 190 AA. 

AC Q2ZJH5; 

DT 20-DEC-2005, integrated into UniProtKB/TrEMBL . 

DT 20-DEC-2005, sequence version 1. 

DT 07-FEB-2006, entry version 3. 

DE Hypothetical protein precursor. 

GN ORFNames=CsacDRAFT_1333 ; 

OS Caldicellulosiruptor saccharolyticus DSM 8903. 

OC Bacteria; Firmicutes; Clostridia; Clostridiales; Syntrophomonadaceae; 

OC Caldicellulosiruptor . 

OX NCBI_TaxID=351627; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=DSM 8 903; 

RG US DOE Joint Genome Institute ( JGI-PGF) ; 

RA Copeland A., Lucas S., Lapidus A., Barry K., Detter J.C., Glavina T., 

RA Hammon N., Israni S., Pitluck S., Richardson P.; 

RT "Sequencing of the draft genome and assembly of Caldicellulosiruptor 

RT saccharolyticus DSM 8903."; 

RL Submitted (OCT-2005) to the EMBL/GenBank/DDBJ databases. 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=DSM 8 903; 

RG US DOE Joint Genome Institute ( JGI-ORNL) ; 

RA Larimer F. , Land M. ; 

RT "Annotation of the draft genome assembly of Caldicellulosiruptor 

RT saccharolyticus DSM 8903."; 

RL Submitted (NOV-2005) to the EMBL/GenBank/DDBJ databases. 

CC -!- CAUTION: The sequence shown here is derived from an 

CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is 

CC preliminary data . 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AALW01000009 ; EAP43465.1; -; Genomic_DNA. 

KW Hypothetical protein; Signal. 

FT SIGNAL 1 34 Potential. 

SQ SEQUENCE 190 AA; 21788 MW; C3 6C01 58B923C919 CRC64; 



Query Match 12.0%; Score 105; DB 2; Length 190; 

Best Local Similarity 22.6%; Pred. No. 1.1; 
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Matches 38; Conservative 39; Mismatches 75; Indels 16; Gaps 6; 



Qy 4 RSKFKIGLLLIGSLLAALSFHLEALAEKPAKVQIQLEKVYLDGDVG--IENKVEAAR 58 

: I I I | | ::::::::: | :: | | | | | | : : 

Db 32 KSKIKSKQQLKGKNITEIAYIKDSFPQRINEDTILWRKY FKG-CGHIIEEKSNISKEFV 90 

Qy 59 --TLEDFKAAYKGWQLIDQKKGFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGI 116 

I MM: : II:: ::: : : :| :| :| :: |: :| : 

Db 91 NMTKEDFKSLFSGWEIDAFNSKYWISRTFEGYC SNHFIISIKDDRVAIFYSQP-V 145 



Qy 117 LSEPIQSFFQIDIKRLESRMADDLRKGIPYRTKKEFEHVTEAVKSSGS 164 

: : : I I I : : II : I M I I I : I : I I 

Db 14 6 DGDNLKLVTPI S IDDLPEKEVEDLKKGI WNS FEDAI KIVEDFGS 190 



RESULT 14 
Q33NB2_METHU 

ID Q33NB2_METHU PRELIMINARY; PRT; 682 AA. 

AC Q33NB2; 

DT 06-DEC-2005, integrated into UniProtKB/TrEMBL . 

DT 06-DEC-2005, sequence version 1. 

DT 07-FEB-2006, entry version 3. 

DE Hypothetical protein. 

GN ORFName s =MhunDRAFT_3 177; 

OS Methanospirillum hungatei JF-1. 

OC Archaea; Euryarchaeota; Methanomicrobia; Methanomicrobiales; 

OC Methanospirillaceae; Methanospirillum. 

OX NCBI_TaxID=3232 5 9; 

RN [1] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=JF-1; 

RG US DOE Joint Genome Institute ( JGI-PGF) ; 

RA Copeland A., Lucas S., Lapidus A., Barry K. , Detter J.C., Glavina T., 

RA Hammon N., Israni S., Pitluck S., Richardson P.; 

RT "Sequencing of the draft genome and assembly of Methanospirillum 

RT hungatei JF-1."; 

RL Submitted (OCT-2005) to the EMBL/GenBank/DDB J databases. 

RN [2] 

RP NUCLEOTIDE SEQUENCE. 

RC STRAIN=JF-1; 

RG US DOE Joint Genome Institute ( JGI-ORNL) ; 

RA Larimer F . , Land M. ; 

RT "Annotation of the draft genome assembly of Methanospirillum hungatei 

RT JF-1."; 

RL Submitted (NOV-2005) to the EMBL/GenBank/DDB J databases. 

CC -!- CAUTION: The sequence shown here is derived from an 

CC EMBL/GenBank/DDB J whole genome shotgun (WGS) entry which is 

CC preliminary data. 

CC ' 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC • 

DR EMBL; AALU01000001 ; EAP17254.1; Genomic_DNA. 

KW Hypothetical protein. 

SQ SEQUENCE 682 AA; 79233 MW; DD76EF8BBDE3B67D CRC64; 



Query Match 11.3%; Score 99; DB 2; Length 682; 

Best Local Similarity 25.6%; Pred. No. 15; 

Matches 41; Conservative 32; Mismatches 55; Indels 32; Gaps 6; 
Qy 11 LLLIGSLLAALSFHLEALAEKPAKVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAYKGW 70 
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I I I :|| :| :: II |::|| : I |:: |:: :||| I I 

Db 90 LTRISELEKSLSEYLSSIQEKEAQIQILTDLVK EHENTASQKIEDFSARIK — 140 

Qy 71 QLIDQ-KKGFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDI 129 

MM: : M I : I : I : : I I I : I : I : 
Db 141 QLTDQLESQKVLFEKEKSDLR — SEYEEILAKTRDEWRKEHELRTLASD 188 

Qy 130 KRLESRMADDLR KGIPYRTKKEFEHVIEAVKS 161 

I : : I : I : : I : I : I : : I : : I 

Db 189 — LKNRIAEEKRVQKDRIREQGEYTKRISELQSFLESARS 226 



RESULT 15 
TBPB_NEIMB 

ID TBPB_NEIMB STANDARD; PRT; 712 AA. 

AC Q9K0V0; 

DT ll-JAN-2001, integrated into UniProtKB/Swiss-Prot . 

DT 01-OCT-2000, sequence version 1. 

DT 07-MAR-2006, entry version 31. 

DE Transf errin-binding protein 2 precursor (TBP-2) . 

GN Name=tbpB; Synonyms=tbp2 ; OrderedLocusNames=NMB04 60 ; 

OS Neisseria meningitidis serogroup B. 

OC Bacteria; Proteobacteria ; Betaproteobacteria; Neisseriales; 

OC Neisseriaceae; Neisseria. 

OX NCBI_TaxID=4 91; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE GENOMIC DNA] . 

RC STRAIN=MC58 / Serogroup B; 

RX MEDLINE=20175755; PubMed-107 10307 ; DOI=10 . 1126/science . 287 . 5459 . 1809; 

RA Tettelin H., Saunders N.J., Heidelberg J.F., Jeffries A.C., 

RA Nelson K.E., Eisen J. A., Ketchum K.A. , Hood D.W., Peden J.F., 

RA Dodson R.J., Nelson W.C., Gwinn M.L., DeBoy R.T., Peterson J.D., 

RA Hickey E.K., Haft D.H., Salzberg S.L., White O., Fleischmann R.D., 

RA Dougherty B.A., Mason T.M., Ciecko A., Parksey D.S., Blair E., 

RA Cittone H., Clark E.B., Cotton M.D., Utterback T.R., Khouri H.M., 

RA Qin H., Vamathevan J.J., Gill J., Scarlato V., Masignani V., Pizza M., 

RA Grandi G., Sun L. # Smith H.O., Fraser CM., Moxon E.R., Rappuoli R,, 

RA Venter J.C.; 

RT "Complete genome sequence of Neisseria meningitidis serogroup B strain 

RT MC58."; 

RL Science 287:1809-1815(2000). 

CC -!- FUNCTION: Acts as a transferrin receptor and is required for 
CC transferrin utilization (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Outer membrane; lipid-anchor (Probable). 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribution-NoDerivs License 

CC 

DR EMBL; AE002098; AAF40897.1; -; Genomic_DNA. 

DR PIR; E81196; E81196. 

DR GenomeReviews; AE002 0 98_GR; NMB04 60. 

DR TIGR; NMB0460; 

DR BioCyc; NMEN4 91 : NMB 04 60 -MONOMER; -'. 

DR InterPro; IPR000437; Prok_lipoprot_S . 

DR InterPro; IPR001677; Transf errin_bd . 

DR Pfam; PF01298; Lipoprotein_5 ; 1. 

DR PROSITE; PS00013; PROKAR_LIPOPROTEIN; FALSE_NEG. 

KW Complete proteome; Lipoprotein; Membrane; Outer membrane; Palmitate; 

KW Receptor; Signal. 

FT SIGNAL 1 20 By similarity. 

FT CHAIN 21 712 Transf errin-binding protein 2. 
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FT /FTId=PRO_0000018193 . 

FT LIPID 21 21 N-palmitoyl cysteine (Probable) . 

FT LIPID 21 21 S-diacylglycerol cysteine (Probable) 

SQ SEQUENCE 712 AA; 77417 MW; 77EA248941E8EF0C CRC64; 



Query Match 11.3%; Score 98.5; DB 1; Length 712; 

Best Local Similarity 25.5%; Pred. No. 17; 

Matches 40; Conservative 21; Mismatches 55; Indels 41; 



Gaps 



7; 



Qy 

Db 

Qy 

Db 



30 EKPAKVQIQLEKVYLDGD VGIENKVEAARTLEDFKAAYKG 69 

I I : : : I I I I I II I : I : I I I I 

99 ELPKRQKSVIEKVETDSDNNIYSSPYLKPSNHQNGNTGNGINQPKNQAKDYENFKYVYSG 158 

70 WQLIDQKKGFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDI 12 9 

I I : I I : I I I I : I I I : I I : I : : : I 

159 WFYKHAKREFNL KVEPKSAKNG DDGYI- FYHGKEP — SRQLPASGKITY 204 



Qy 



Db 



130 KRLESRMADDLRKGIPYRTKKEFEHVIEAVKSSGSQH 166 

I : I I : I I : : I : I : III:: 

205 KGV-WHFATDTKKG QKFREIIQPSKSQGDRY 234 



Search completed: June 2, 2006, 20:29:51 
Job time : 14 6 sees 



SCORE 1.3 BuildDate: 12/06/2005 
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Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: June 2, 2006, 20:27:32 



Search time 22 Seconds 
(without alignments) 
756.614 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-510-408-2 
875 

1 MYSRSKFKIGLLLIGSLLAA. 



. HVIEAVKSSGSQHHVEDMKT 17 3 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



283416 seqs, 96216763 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



283416 



Post-processing : 



Database 



Minimum Match 0% 

Maximum Match 100% 

Listing first 45 summaries 

PIR 80:* 



pirl : * 
pir2 : * 
pir3 : * 
pir4 : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 


Description 


1 


486.5 


55.6 


170 


2 


A69596 


forespore sigma-K 


2 


232 


26.5 


178 


2 


G83802 


forespore sigma-K 


3 


98.5 


11.3 


712 


2 


E81196 


trans f err in-bindin 


4 


92.5 


10.6 


318 


1 


E71690 


minor teichoic aci 


5 


91 


10.4 


614 


2 


E70120 


hypothetical prote 


6 


87 


9.9 


1172 


2 


S51623 


cutl4 protein - fi 


7 


86 


9.8 


1329 


2 


AE1901 


WD-repeat containi 


8 


85.5 


9.8 


435 


2 


149327 


clathr in- associate 


9 


85.5 


9.8 


435 


2 


A31596 


clathrin coat asse 


10 


84.5 


9.7 


435 


2 


G02088 


assembly protein 5 


11 


84 


9.6 


338 


1 


DENDG 


glyceraldehyde-3-p 


12 


83.5 


9.5 


180 


2 


C89929 


conserved hypothet 


13 


83 


9.5 


328 


2 


F83599 


hypothetical prote 


14 


82 


9.4 


1281 


1 


GNMSLL 


retrovirus-related 


15 


82 


9.4 


1375 


2 


T37672 


probable DNA repai 


16 


81.5 


9.3 


242 


2 


A89852 


conserved hypothet 


17 


81.5 


9.3 


435 


2 


JC6563 


clathrin- associate 


18 


81.5 


9.3 


628 


2 


H89917 


conserved hypothet 


19 


80.5 


9.2 


620 


2 


C69278 


chorismate mutase/ 
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Q 

^ . 


1 

i X 


166 


2 


D95203 


■i qnrhnr i sTri3t*3SP "fa 

-LOlllCl La O C X Q 


22 


80 


9 t 


1 


328 


2 


B38477 


hvDothe tica 1 nrote 


23 


80 


9 t 


1 


370 


2 


A25004 


keratin. 53K tvnp 




fin 


Q 

-7 < 


1 

• X 


416 




T52497 


hvnnt - hp 1" i r*a 1 nrotp 


25 


79 . 5 


9 ( 


1 


553 


2 


F96982 


Vision threonvl-tR 


? 

o 


7 Q ^ 


Q 

-7 « 


1 

. J. 


R77 


? 


H7?^fift 


nrnl i n p — t" RMA 1 i nfl «; 




79 S 


Q 

^ « 


1 

• X 


1517 




F65112 


nlutamate svnthase 


9 ft 
^ o 


7Q ^ 


Q 
-7 « 


1 

. X 


l S^ft 


c 


7v FD4 1 ? 


fTliit"3Trip1"p s \7nt*ha *5P 


-7 


7 9 


Q 




^?ft 


9 
c. 


O / c O J u 


harul ovi rus reDeat 

XJ ci Vw- L4. _i_ v»/ v -i. ^_ \~i <j i. ^^/c a. 


•J w 


7 9 


9 t 


0 


348 


2 


S38148 


hvDothetical nrnte 


O X 


7 9 


9 t 


0 


353 


2 


A7 2 4 1 0 


chorisTnate tnutase/ 




7 9 


9 t 


0 


-j zj z> 


2 


T48039 


hvnothpti ra 1 nrot"p 


33 


7 9 


9 . 


, 0 


934 


2 


H88391 


orotein R06B10 2 T 


^4 


7ft R 


q 


n 


£. Z> -J 


9 




yxyuciaiuciiyuc -J ^> 




7ft S 


q 

-7 . 


n 

, \j 


JU1 


? 


R904 1 4 


H vriot* h p t* S c a 1 riT*ot*p 


36 


78 . 5 


9 , 


. 0 


722 


2 


A82617 


alvcvl- tRNA svnthe 


^7 


7ft S 


q 


0 


1486 


2 


AT0906 


nlutamate svnthase 


38 


78 


8 . 


9 


338 


2 


T06781 


alvcera ldehvde — 3-D 


o .7 


7 ft 


Q 

o . 


q 




9 


1 V O D -7 


h \/Ti nthpt *i ra 1 nrnfp 


40 


77 . 5 


8 , 


, 9 


297 


2 


E86828 


hypothetical prote 


41 


77.5 


8. 


.9 


390 


2 


S25787 


hypothetical prote 


42 


77 


8. 


.8 


259 


2 


H64211 


guanylate kinase ( 


43 


77 


8. 


.8 


281 


2 


F69323 


ATP phosphoribosyl 


44 


77 


8. 


,8 


390 


2 


D82922 


conserved hypothet 


45 


77 


8. 


.8 


461 


2 


B82155 


deoxyribodipyrimid 



ALIGNMENTS 
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GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 
Run on: June 2, 2006, 20:27:43 



Search time 183 Seconds 
(without alignments) 
437.903 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-510-408-2 
875 

1 MYSRSKFKIGLLLIGSLLAA. 



. HVIEAVKSSGSQHHVEDMKT 173 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 2097797 seqs, 463214858 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : 



2097797 



Published_Applications_AA_Main : * 

1 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US07_PUBCOMB.pep: * 

2 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US08_PUBCOMB.pep: * 

3 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US09_PUBCOMB.pep: * 

4 : /EMC_CelerraJ3IDS3/ptodata/2/pubpaa/US10A_PUBCOMB.pep: * 

5 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa/US10B_PUBCOMB.pep: * 

6: /EMC_Celerra_SIDS3/ptodata/2/pubpaa/USll_PUBCOMB.pep:* 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 








Description 


1 


875 


100 


.0 


173 


5 


US- 


10- 


510 


-408-2 


Sequence 


2, Appli 


2 


875 


100 


.0 


173 


5 


US- 


10- 


510 


-386-124 


Sequence 


124, App 


3 


98.5 


11 


.3 


712 


4 


US- 


10- 


240 


-218-8 


Sequence 


8, Appli 


4 


98.5 


11 


.3 


712 


4 


US- 


10- 


320 


-800-73 


Sequence 


73, Appl 


5 


98.5 


11 


.3 


712 


4 


US- 


10- 


220 


-481-85 


Sequence 


85, Appl 


6 


91 


10 


.4 


592 


5 


US- 


10- 


994 


-726-346 


Sequence 


34 6, App 


7 


91 


10 


.4 


612 


5 


us- 


10- 


994 


-726-345 


Sequence 


34 5, App 


8 


89.5 


10 


.2 


435 


4 


us- 


10- 


408 


-765A-1973 


Sequence 


1973, Ap 


9 


88.5 


10 


.1 


845 


4 


US- 


10- 


437 


-963-186548 


Sequence 


186548, 


10 


87.5 


10 


.0 


431 


4 


us- 


10- 


408 


-765A-2629 


Sequence 


2629, Ap 


11 


87 


9 


.9 


1172 


4 


us- 


10- 


369 


-493-2552 


Sequence 


2552, Ap 


12 


86 


9 


.8 


1533 


4 


us- 


10- 


437 


-963-128345 


Sequence 


128345, 


13 


85.5 


9 


.8 


435 


5 


us- 


10- 


367 


-057-61 


Sequence 


61, Appl 


14 


84 


9 


.6 


338 


6 


us- 


11- 


188 


-298-2895 


Sequence 


2895, Ap 


15 


84 


9 


.6 


980 


4 


us- 


10- 


437 


-963-128383 


Sequence 


128383, 


16 


82.5 


9 


.4 


256 


4 


us- 


10- 


424 


-599-167485 


Sequence 


167485, 
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17 


ft? 


g 


. 4 


1375 


5 


US- 


10- 


732 


-923-8606 


Seauence 


8 606 An 


1 R 

X O 


ft 1 s 

O x . J 


q 


-i 
. j 


61 ft 

O X o 


j 


US- 

VJ <J 


09- 

\J m* 


815 

O X J 




kJCVjUCHL-C 


10864 A 


19 


81.5 


9 


. 3 


628 


3 


us- 


09- 


815 


-242-12616 


Sequence 


12616 A 


20 


81.5 


9 


. 3 


628 


4 


us- 


10- 


282 


-122A-44243 


Sequence 


44243, A 


9 1 

£. X 


ft! S 

OX > J 


Q 

-7 


. 3 


640 


3 


US- 


09- 


815 


-242-5296 


Seauence 

fcJ U W 


529 6. Ad 


9 9 


ft! S 

o X . J 


q 


• o 


770 


4 


US - 


10- 


282 


-122A-S7088 

X^.^./A J / WOO 


feculence 


57088. A 


9 3 


80 


9 


B 1 


166 


4 


US- 


10- 


282 


-122A-73643 


Seauence 


73643, A 


2 4 


RO 


q 


( l 


166 


5 


US - 


10- 


472 


-928-3600 


Seauence 


3600, Ap 


25 


80 


9 


, 1 


166 


5 


us- 


10- 


617 


-320-3096 


Seauence 


3096, Ap 


9 6 

o 


7 9 S 


q 


1 

. X 


61 0 

OX VJ 


5 
j 


us- 

V/O 


1 0- 

X v 


4 SO 


-763-1671 1 




36711, A 

<tJ V f J> J> f * x 


27 


79 S 
/ . j 


q 




906 


5 


us- 


10- 


450 


-763-48680 


Seauence 


48680, A 


28 


79.5 


9 


. 1 


144 9 


5 


US- 


10- 


450 


-763-60431 


Sequence 


60431, A 


2 9 


79 S 

/ ^ . J 


9 




1517 


3 


US- 


09- 


815 


-242-10325 


Sequence 


10325, A 


30 


79 S 
f ^ . j 


9 


. 1 


1517 


4 


US- 


10- 


282 


-122A-56712 


Seauence 


56712, A 


j X 


7 9 S 


q 


1 

> X 


1 Sift 

X J o o 


4 




1 0- 
x vj 


?ft? 


-1 22A-78229 


S eauence 

i»J V^ \J UCll^C 


78229 A 

f VJ Cm, Cm ■mf f J*. 


19 


7 9 


q 


. 0 


227 3 

C, C~ 1 J 


4 


us - 


10- 


425 


-115-314032 

XXJ J X T V O L~ 


•J V- VJ UV.11V> V 


314032 


33 


7R S 

/O.J 


9 


. 0 


278 


5 


US- 


10- 


450 


-763-58916 


Seau ence 


58916. A 


^4 


7 ft S 

/O.J 


q 




29S 


g 


US- 


11- 


188 


-298-1 2304 

mm Zt fj 1L Jul 


Seauence 


12304. A 


3 S 


7R S 

/O.J 


q 




1 4fi 6 

X T O O 


3 


US- 


09- 


815 


-242-13728 


Seauence 

%J V^ VJ VJ. V^ 1 w 


13728, A 


^ 6 


7 R S 

/O.J 


q 


. VJ 


I486 


4 




i n- 

X u 


?R? 


-1 22A-7 S0S7 

l.CmC.t\. 1 JV J / 


C VJ UCll^ C 


75057 A 


^7 


7R S 

/O.J 


9 


. 0 


1486 


4 


US- 


10- 


282 


-122A-75654 


Seauence 

*-J W VJ VA W llv W 


75654, A 


o o 


7R 

/ O 


8 


. 9 


338 




US- 


11- 


188 


-298-1497 


Seauence 

V-* VJ VJ W ilv V- 


1497, Ap 




7 R 


q 
o 


q 


4 9S 

*a y J 


4 




i n- 

X u 


494 

*3 £. *a 


_ c q q _ 9 t o 7 q -3 




232793, 




77 S 
/ / . j 


ft 


9 


322 


4 


US- 


10- 


369 


-493-13946 


Seauence 


13946, A 


41 


77.5 


8 


. 9 


434 


4 


US- 


10- 


425 


-114-54183 


Sequence 


54183, A 


42 


77.5 


8 


.9 


434 


4 


US- 


10- 


425 


-115-291120 


Sequence 


291120, 


43 


77.5 


8 


.9 


536 


3 


US- 


09- 


738 


-626-6061 


Sequence 


6061, Ap 


44 


77.5 


8 


.9 


876 


4 


US- 


10- 


437 


-963-168060 


Sequence 


168060, 


45 


77.5 


8 


.9 


1639 


4 


US- 


10- 


437 


-963-128536 


Sequence 


128536, 
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GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 

Run on: June 2, 2006, 20:28:58 / Search time 14 Seconds 

(without alignments) 
142.912 Million cell updates/sec 



Title: US-10-510-408-2 
Perfect score: 875 

Sequence: 1 MYSRSKFKIGLLLIGSLLAA. 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 



HVIEAVKSSGSQHHVEDMKT 17 3 



Searched: 58871 seqs, 11565156 residues 

Total number of hits satisfying chosen parameters: 58871 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : Published_Applications_AA_New: * 

1 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa 

2 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa 

3 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa 

4 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa 

5 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa 

6 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa 

7 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa 

8 : /EMC_Celerra_SIDS3/ptodata/2/pubpaa 



/US 0 9_NEW_PUB . pep : * 
/US 0 6_NEW_PUB . pep : * 
/US 07_NEW_PUB . pep : * 
/US08_NEW_PUB . pep : * 
/PCT_NEW_PUB . pep : * 
/US 1 0_NEW_PUB . pep : * 
/US 1 1_NEW_PUB . pep : * 
/US 60_NEW_PUB . pep : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


77 


8 


.8 


426 


6 


US- 


10- 


953-349-16804 


Sequence 


16804, A 


2 


77 


8 


.8 


431 


6 


us- 


10- 


953-349-16803 


Sequence 


16803, A 


3 


75.5 


8 


.6 


887 


6 


us- 


10- 


527-411-161 


Sequence 


161, App 


4 


74.5 


8 


.5 


502 


6 


us- 


10- 


953-349-3682 


Sequence 


3682, Ap 


5 


74 


8 


.5 


312 


6 


us- 


10- 


953-349-23648 


Sequence 


23648, A 


6 


74 


8 


.5 


641 


7 


us- 


11- 


249-111-76 


Sequence 


7 6, Appl 


7 


73.5 


8 


.4 


348 


6 


us- 


10- 


953-349-6565 


Sequence 


6565, Ap 


8 


73.5 


8 


.4 


382 


6 


us- 


10- 


953-349-6564 


Sequence 


6564, Ap 


9 


73.5 


8 


.4 


386 


6 


us- 


10- 


953-349-6563 


Sequence 


6563, Ap 


10 


73 


8 


.3 


294 


6 


us- 


10- 


953-349-8740 


Sequence 


8740, Ap 


11 


73 


8 


.3 


338 


6 


us- 


10- 


953-349-8739 


Sequence 


8739, Ap 


12 


73 


8 


.3 


364 


6 


us- 


10- 


953-349-8738 


Sequence 


8738, Ap 


13 


72.5 


8 


.3 


259 


6 


us- 


10- 


953-349-16805 


Sequence 


16805, A 


14 


72.5 


8 


.3 


2871 


6 


us- 


10- 


505-928-100 


Sequence 


100, App 
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15 


71.5 


8 


. 2 


296 


7 


US- 


11- 


246- 


957 


-15 


Sequence 


15, AddI 


16 


71.5 


8 


. 2 


1013 


6 


us- 


10- 


527- 


411 


-18 


Sequence 


18, Appl 


17 


71 


8 


. l 


311 


6 


us- 


10- 


953- 


349 


-21965 


Sequence 


21965, A 


18 


71 


8 


. 1 


321 


6 


us- 


10- 


953- 


349 


-21964 


Sequence 


21964, A 


19 


71 


8 


. 1 


382 


6 


us- 


10- 


953- 


349 


-3683 


Sequence 


3683, Ap 


20 


71 


8 


. 1 


708 


7 


us- 


11- 


293- 


697 


-4329 


Sequence 


4329, Ap 


21 


70 


8 


. 0 


291 


6 


us- 


10- 


953- 


349 


-8115 


Sequence 


8115, Ap 


22 


70 


8 


. 0 


294 


6 


us- 


10- 


953- 


349 


-8114 


Sequence 


8114, Ap 


23 


70 


8 


. 0 


338 




us- 


10- 


953- 


349 


-8113 


Sequence 


8113, Ap 


24 


70 


8 


. 0 


787 


7 


us- 


11- 


293- 


697 


-3773 


Sequence 


3773, Ap 


25 


69 . 5 


7 


. 9 


398 


7 


us- 


11- 


219- 


635 


-18 


Sequence 


18, AddI 


2 6 


69 . 5 


7 


. 9 


673 


7 


us- 


11- 


219- 


635 


-2 


Sequence 


2, Appli 


27 


69 . 5 


7 


. 9 


770 


7 


us- 


11- 


219- 


635 


-26 


Sequence 


2 6, Appl 


28 


69 


7 


, 9 


323 


6 


us- 


10- 


953- 


349 


-11054 


Sequence 


11054, A 




69 


7 


. 9 


341 


6 


us- 


10- 


953- 


349 


-11053 


Sequence 


11053, A 


30 


69 


7 


. 9 


471 




us- 


10- 


953- 


349 


-11052 


Sequence 


11052, A 


O J. 


68 . 5 


7 


. 8 


464 




us- 


10- 


953- 


349 


-4418 


Sequence 


4 418, Ap 


32 


68 . 5 


7 


. 8 


469 


6 


us- 


10- 


953- 


349 


-4417 


Sequence 


4 417, Ap 


33 


68 . 5 


7 


. 8 


504 


6 


us- 


10- 


953- 


349 


-4416 


Sequence 


4416, Ap 


34 


68 


7 


. 8 


211 




us- 


10- 


953- 


349 


-4200 


Sequence 


4200, Ap 


35 


68 


7 


. 8 


246 


6 


us- 


10- 


953- 


349 


-4199 


Sequence 


4199, Ap 


36 


68 


7 


. 8 


272 


6 


us- 


10- 


953- 


349 


-4198 


Sequence 


4198, Ap 


37 


68 


7 


. 8 


442 




us- 


10- 


505- 


928 


-525 


Sequence 


525, Add 


o o 


67 . 5 


7 


. 7 


879 




us- 


10- 


527- 


411 


-159 


Sequence 


159, Add 


39 


67 


7 


, 7 


296 


7 


TJS- 


11- 


246- 


957 


-20 


Sequence 


20, Appl 


40 


67 


7 


. 7 


863 


6 


us- 


10- 


505- 


928 


-158 


Sequence 


158, App 


41 


66.5 


7 


.6 


216 


6 


us- 


10- 


953- 


349 


-28992 


Sequence 


28992, A 


42 


66.5 


7 


.6 


305 


6 


us- 


10- 


953- 


349 


-23286 


Sequence 


23286, A 


43 


66 


7 


.5 


1085 


6 


us- 


10- 


•505- 


928 


-175 


Sequence 


175, App 


44 


65.5 


7 


.5 


419 


6 


us- 


10- 


953- 


349 


-21528 


Sequence 


21528, A 


45 


65.5 


7 


.5 


467 


6 


us- 


10- 


953- 


349 


-21527 


Sequence 


21527, A 



ALIGNMENTS 
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SCORE Search Results Details for Applicatioi 
10510408 and Search Result us-10-510-408-2, 



Score Home Page Retrieve Application List SCORE System Overview SCORE FAQ Comments / Sugg 

This page gives you Search Results detail for the Application 10510408 and Search Result us-10-5; 

2.rai. 

start 

Go Back to p 

GenCore version 5.1.9 
Copyright (c) 1993 - 2006 Biocceleration Ltd. 

OM protein - protein search, using sw model 

Run on: June 2, 2006, 20:27:41 ; Search time 49 Seconds 

(without alignments) 
309.037 Million cell updates/sec 

Title : US-10-510-408-2 
Perfect score: 875 

Sequence: 1 MYSRSKFKIGLLLIGSLLAA HVIEAVKSSGSQHHVEDMKT 173 

Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 650591 seqs, 87530628 residues 

Total number of hits satisfying chosen parameters: 650591 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : Issued Patents AA: * 



/EMC_Celerra_SIDS3/ptodata/2/iaa/5_COMB.pep:* 
/EMC_Celerra_SIDS3/ptodata/2/iaa/6_COMB.pep:* 
/EMC_Celerra_SIDS3/ptodata/2/iaa/7_COMB.pep:* 
/EMC_Celerra_SIDS3/ptodata/2/iaa/H_COMB.pep: * 
/EMC_Celerra_SIDS3/ptodata/2/iaa/PCTUS_COMB.pep: 
/EMC_Celerra_SIDS3/ptodata/2/iaa/RE_COMB.pep: * 
/•EMC_Celerra_SIDS3/ptodata/2/iaa/backf ilesl .pep: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 

1 94 10.7 298 2 US-09-328-352-6064 Sequence 6064, Ap 

2 91 10.4 592 2 US-09-830-230A-346 Sequence 346, App 
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3 


91 


10 


. 4 


612 


2 


US- 


09- 


830 


-230A-345 


Sequence 


345, App 


4 


85 . 5 


9 


. 8 


435 


2 


us- 


09- 


538 


-092-963 


Sequence 


963, App 


5 


84 . 5 


9 


. 7 


A *3 C 

435 


2 


US- 


09- 


664 


-958-12 


Sequence 


12/ Appl 


6 


83 


9 


. 5 


354 


2 


us- 


09- 


252 


-991A-27916 


Sequence 


27916, A 


7 


80 


9 


. 1 


166 


2 


US- 


09- 


583 


-110-3094 


Sequence 


3094, Ap 


8 


80 


9 


. 1 


166 


2 


us- 


09- 


107 


-433-3096 


Sequence 


3096, Ap 


9 


7 9 


9 


. 0 


348 


2 


US- 


09- 


315 


-794-42 


Sequence 


42, Appl 


10 


79 


9 


. 0 


348 


2 


us- 


09- 


389 


-341-42 


Sequence 


42, Appl 


11 


77 . 5 


8 


. 9 


510 


2 


us- 


09- 


605 


-703B-1144 


Sequence 


114 4, Ap 


12 


77 . 5 


8 


. 9 


534 


2 


us- 


09- 


605 


-703B-1142 


Sequence 


1142, Ap 


13 


76 . 5 


8 


. 7 


155 


2 


us- 


09- 


732 


-210-1631 


Sequence 


1631, Ap 


14 


76. 5 


8 


. 7 


463 


2 


us- 


09- 


134 


-000C-3598 


Sequence 


3598, Ap 


15 


76. 5 


8 


. 7 


732 


5 


PCT 


-US95- 


17026-2 


Sequence 


2, Appli 


16 


76 


8 


. 7 


222 


2 


US- 


09- 


248 


-796A-14267 


Sequence 


14267, A 


17 


76 


8 


. 7 


2954 


2 


US - 


09- 


150 


-867-1 


Sequence 


1, Appli 


18 


76 


8 


. 7 


2954 


3 


us- 


09- 


724 


-584-1 


Sequence 


1, Appli 


19 


75 . 5 


8 


. 6 


341 


2 


us- 


09- 


252 


-991A-21448 


Sequence 


21448, A 


20 


75 . 5 


8 


. 6 


463 


2 


us- 


08- 


792 


-295-1 


Sequence 


1, Appli 


21 


75 . 5 


8 


. 6 


463 


2 


us- 


09- 


076 


-432-1 


Sequence 


1, Appli 


22 


75 . 5 


8 


. 6 


523 


2 


us- 


10- 


104 


-047-3546 


Sequence 


3546, Ap 


23 


75 


8 


. 6 


2183 


2 


us- 


08- 


746 


-111-5 


Sequence 


5, Appli 


24 


74 . 5 


8 


. 5 


343 


2 


us- 


09- 


807 


-258-24 


Sequence 


2 4 , Appl 


25 


74 . 5 


8 


. 5 


427 


2 


us- 


09- 


591 


-447D-2 


Sequence 


2 , Appli 


26 


74 . 5 


8 


. 5 


467 


2 


us- 


09- 


134 


-OOlC-3579 


Sequence 


357 9, Ap 


27 


74 


8 


. 5 


202 


2 


us- 


09- 


134 


-OOlC-3860 


Sequence 


38 60, Ap 


28 


74 


8 


. 5 


307 


2 


us- 


09- 


023 


-905A-36 


Sequence 


36, Appl 


29 


74 


8 


. 5 


359 


2 


us- 


09- 


710 


-279-2454 


Sequence 


2454, Ap 


30 


74 


8 


.5 


378 


2 


us- 


09- 


134 


-001C-4464 


Sequence 


4 4 64, Ap 


31 


74 


8 


. 5 


641 


2 


us- 


09- 


724 


-623-76 


Sequence 


7 6, Appl 


32 


74 


8 


. 5 


641 


3 


us- 


10- 


288 


-930-76 


Sequence 


7 6, Appl 


33 


73 


8 


. 3 


234 


2 


us- 


09- 


107 


-532A-6670 


Sequence 


6670, Ap 


34 


73 


8 


. 3 


316 


2 


us- 


09- 


107 


-532A-4924 


Sequence 


4924, Ap 


35 


73 


8 


.3 


598 


2 


us- 


09- 


512 


-563C-8 


Sequence 


8, Appli 


36 


73 


8 


. 3 


598 


2 


us- 


09- 


512 


-563C-12 


Sequence 


12, Appl 


37 


73 


8 


. 3 


598 


2 


us- 


10- 


060 


-607-8 


Sequence 


8, Appli 


38 


73 


8 


. 3 


598 


2 


us- 


10- 


060 


-607-12 


Sequence 


12, Appl 


39 


73 


8 


. 3 


598 


2 


us- 


10- 


104 


-047-2716 


Sequence 


2716, Ap 


40 


73 


8 


. 3 


653 


2 


us- 


09- 


540 


-236-3128 


Sequence 


3128, Ap 


41 


72.5 


8 


.3 


433 


2 


us- 


09- 


809 


-665A-161 


Sequence 


161, App 


42 


72.5 


8 


.3 


2871 


2 


us- 


09- 


538 


-092-936 


Sequence 


936, App 


43 


72.5 


8 


.3 


3542 


2 


us- 


10- 


087 


-013-2 


Sequence 


2, Appli 


44 


72 


8 


.2 


301 


2 


us- 


09- 


107 


-532A-4934 


Sequence 


4934, Ap 


45 


72 


8 


.2 


377 


2 


us- 


09- 


489 


-039A-12546 


Sequence 


12546, A 



ALIGNMENTS 



RESULT 1 

US-09-328-352-6064 

; Sequence 6064, Application US/09328352 

; Patent No. 6562 958 

; GENERAL INFORMATION: 

; APPLICANT: Gary L. Breton et al. 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO ACINETOBACTER 
/ TITLE OF INVENTION: BAUMANNII FOR DIAGNOSTICS AND THERAPEUTICS 
/ FILE REFERENCE: GTC99-03PA 

; CURRENT APPLICATION NUMBER: US/09/328,352 
/ CURRENT FILING DATE: 1999-06-04 
; NUMBER OF SEQ ID NOS : 8252 

; SEQ ID NO 6064 
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LENGTH: 298 
TYPE: PRT 

ORGANISM: Acinetobacter baumannii 
US-09-328-352-6064 

Query Match 10.7%; Score 94; DB 2; Length 298; 

Best Local Similarity 22.4%; Pred. No. 0.024; 

Matches 39; Conservative 28; Mismatches 55; Indels 52; Gaps 6; 

Qy 34 KVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAYKGWQLIDQKKGFILFRKQVDD 1 89 

|: :|:|: :|| I :: |:: : :|| : :: I : I I 

Db 50 KLDLQVEEKGIDGPVTRVDRYLEQLTIDTLRKSYKNHSFLGEEFGLQEGKGHDADWCWVI 109 

Qy 90 SPLSKTNGYI GVTEDGVI STFHGRPGILSEPIQSFF 125 

I I I : I 111:111 I II:::: 
Db 110 DPLDGTQNFINGFPHFCISIAVQHKGVTQHGVIYDPVRDELFSASRGRGAVMNQ 163 

Qy 126 QIDIKRLESRMADDLRK GI PYRTKKE FEHV IEAVKSSGSQ 165 

: I : : I I I I I I I : I I : I I : I : I 

Db 164 RRI RVNVKDS LENT FLAVGH P YRAKRAGE I VS YAEQH FAS LLAVTQAGAQ 213 



RESULT 2 

US-09-830-230A-346 

; Sequence 34 6, Application US/09830230A 
; Patent No. 6902893 
; GENERAL INFORMATION: 

; APPLICANT: Human Genome Sciences, Inc. 

; TITLE OF INVENTION: Lyme Disease Vaccines 

; FILE REFERENCE: PB481US 

; CURRENT APPLICATION NUMBER: US/09/8 30, 230A 

; CURRENT FILING DATE: 2001-09-27 

; PRIOR APPLICATION NUMBER: PCT/US98/ 12718 

PRIOR FILING DATE: 1998-06-18 
; PRIOR APPLICATION NUMBER: 60/057,483 

PRIOR FILING DATE: 1997-09-03 
; PRIOR APPLICATION NUMBER: 60/053,344 
; PRIOR FILING DATE: 1997-07-22 
; PRIOR APPLICATION NUMBER: 60/053,377 
; PRIOR FILING DATE: 1997-07-22 
; PRIOR APPLICATION NUMBER: 60/050,359 
; PRIOR FILING DATE: 1997-06-20 
; NUMBER OF SEQ ID NOS : 756 
; SOFTWARE: Patentln Ver . 2.0 
; SEQ ID NO 34 6 
LENGTH: 592 
TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-830-230A-346 

Query Match 10.4%; Score 91; DB 2; Length 592; 

Best Local Similarity 28.5%; Pred. No. 0.15; 

Matches 57; Conservative 16; Mismatches 83; Indels 44; Gaps 9; 

Qy 2 YSRSKFK IGLLLIG SLLAALSF-HLEALAEKPAKVQIQLEKVYLDGDVGIENKV 54 

: [ : : I II : I I : I : I I : I I III II II 

Db 105 FSQGDYKRIAIGTAIHGIYLSVNGAISFKNLNRLIP QIYLGAGYYDIISAIEFSK 159 

Qy 55 EAARTLEDFKAAYKGWQLIDQKKGFIL FRKQVDDISPLSKTNGYIGVTEDGVIST 109 

I I I I I I I I I I I : I I : I I I I I : : I 

Db 160 EETNNLYFSSGVYGDIFLISQKSGFIKKISFPFKKQIIRILDLSSKN VEKILVRT 214 
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Qy 110 FHGR PGILSEPIQSFFQIDIKRLESRMADDLRKGIPY RTKKEF 152 

: I I I I I I : II III III 

Db 215 YDNHFYSYINGQWVFIGKLSLQDQDFFE KSQRMQLAKNKGSIYLTAYTLRNKKAV 2 69 

Qy 153 EHVIEAVKSSGSQHHVEDMK 172 

: : : I I I III 
Db 270 DERFKFIKDSGMNAWIDFK 289 



RESULT 3 

US-09-830-230A-345 

Sequence 345, Application US/09830230A 
Patent No. 6902893 
GENERAL INFORMATION: 
APPLICANT: Human Genome Sciences, Inc. 
TITLE OF INVENTION: Lyme Disease Vaccines 
FILE REFERENCE: PB481US 

CURRENT APPLICATION NUMBER: US/09/830, 230A 
CURRENT FILING DATE: 2001-09-27 
PRIOR APPLICATION NUMBER: PCT/US98/12718 
PRIOR FILING DATE: 1998-06-18 
PRIOR APPLICATION NUMBER : 60/057,483 
PRIOR FILING DATE: 1997-09-03 
PRIOR APPLICATION NUMBER: 60/053,344 
PRIOR FILING DATE: 1997-07-22 
PRIOR APPLICATION NUMBER: 60/053,377 
PRIOR FILING DATE: 1997-07-22 
PRIOR APPLICATION NUMBER: 60/050,359 
PRIOR FILING DATE: 1997-06-20 
NUMBER OF SEQ ID NOS : 756 
SOFTWARE: Patentln Ver . 2.0 
SEQ ID NO 345 
LENGTH: 612 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-830-230A-345 

Query Match 10.4%; Score 91; DB 2; Length 612; 

Best Local Similarity 28.5%; Pred. No. 0.15; 

Matches 57; Conservative 16; Mismatches 83; Indels 44; Gaps 9; 

Qy 2 YSRSKFK IGLLLIG SLLAALSF-HLEALAEKPAKVQIQLEKVYLDGDVGIENKV 54 

: I : : I I I : I I : I : I I : I I III II II 

Db 125 FSQGDYKRIAIGTAIHGIYLSVNGAISFKNLNRLIP QIYLGAGYYDIISAIEFSK 179 

Qy 55 EAARTLEDFKAAYKGWQLIDQKKGFIL FRKQVDD I S PLS KTNGY I GVT EDGVI S T 109 

I I I I I I I I I I I : I I : I I I I I : : I 

Db 180 EETNNLYFSSGVYGDIFLISQKSGFIKKISFPFKKQIIRILDLSSKN VEKILVRT 234 

Qy 110 FHGR PGILSEPIQSFFQIDIKRLESRMADDLRKGIPY RTKKEF 152 

: I I I I I I : II Ml III 

Db 235 YDNHFYSYINGQWVFIGKLSLQDQDFFE KSQRMQLAKNKGSIYLTAYTLRNKKAV 289 

Qy 153 EHVIEAVKSSGSQHHVEDMK 172 

: : : I I I III 
Db 2 90 DERFKFIKDSGMNAWIDFK 309 



RESULT 4 

US-09-538-092-963 
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Sequence 963, Application US/09538092 
Patent No. 6753314 
GENERAL INFORMATION: 
APPLICANT: Giot, Loic 
APPLICANT: Mansfield, Traci A. 

TITLE OF INVENTION: Protein- Protein Complexes and Method of Using Same 
FILE REFERENCE: 15966-542 

CURRENT APPLICATION NUMBER: US/09/538,092 
CURRENT FILING DATE: 2000-03-29 
PRIOR APPLICATION NUMBER: 60/127,352 
PRIOR FILING DATE: 1999-04-01 
PRIOR APPLICATION NUMBER: 60/178,965 
PRIOR FILING DATE: 2000-02-01 
NUMBER OF SEQ ID NOS : 1387 
SOFTWARE: CuraPatSeqFormatter Version 0.9 
SEQ ID NO 963 
LENGTH: 435 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE : 

NAME/KEY : misc_f eature 
LOCATION: (0) ... (0) 

OTHER INFORMATION: Polypeptide Accession Number P20172 
US-09-538-092-963 

Query Match 9.8%; Score 85.5; DB 2; Length 435; 

Best Local Similarity 28.2%; Pred. No. 0.4; 

Matches 35; Conservative 20; Mismatches 42; Indels 27; Gaps 6; 

Qy 51 ENKVEAARTLE DFKAAYKGWQLIDQ— KKGFILFRKQVDDISPLSKTNGYIGVT 102 

: I I I I I I I I I : : : : I I : I : : I : I II : 

Db 70 KQNVNAAMVFEFLYKMCDVMAAYFG-KISEENIKNNFVLIYELLDEILDF GYPQNS 124 

Qy 103 EDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDL RKGIPYRTKKEFEHV 155 

I I : I I : I I I I I : I : : : I : I I I I : I I 

Db 125 ETGALKTFITQQGIKS QHQTKEEQSQITSQVTGQIGWRREGIKYRRNELFLDV 177 

Qy 156 IEAV 159 

: I : I 

Db 178 LESV 181 



RESULT 5 

US-09-664-958-12 

; Sequence 12, Application US/09664958 
; Patent No. 6916912 
; GENERAL INFORMATION: 
; APPLICANT: Trakht, Ilya 
; APPLICANT: Canfield, Robert 
; APPLICANT: Kalantarov, Gary 
APPLICANT: Rudchenko, Sergei 

TITLE OF INVENTION: No. 6916912el Tumor-Associaed Marker 
; FILE REFERENCE: 0575/60240 

; CURRENT APPLICATION NUMBER: US/09/664,958 
; CURRENT FILING DATE: 2000-09-18 
; NUMBER OF SEQ ID NOS: 28 

SOFTWARE: Patentln version 3.1 
; SEQ ID NO 12 

LENGTH: 4 35 

TYPE: PRT 

ORGANISM: Human 
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US-09-664-958-12 

Query Match 9.7%; Score 84.5; DB 2; Length 435; 

Best Local Similarity 28.2%; Pred. No. 0.52; 

Matches 35; Conservative 20; Mismatches 42; Indels 27; Gaps 6; 

Qy 51 ENKVEAARTLE DFKAAYKGWQLIDQ--KKGFILFRKQVDDISPLSKTNGYIGVT 102 

: I I I I I I I I I : : : : I I : I : : I : I II : 

Db . 70 KQNWAAMVFEFLYKMCDVMAAYFG-KISEENIKNNFLLIYELLDEILDF GYPQNS 124 

Qy 103 EDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDL RKGIPYRTKKEFEHV 155 

I I : I I : I I I I I : I : : : I : I I I I : I I 

Db 125 ETGALKT FITQQGIKS QHQTKEEQSQITSQVTGQIGWRREGIKYRRNELFLDV 177 

Qy 156 IEAV 159 

: I : I 

Db 178 LESV 181 



RESULT 6 

US-09-252-991A-27916 

; Sequence 27916, Application US/09252991A 
; Patent No. 6551795 
; GENERAL INFORMATION: 

; APPLICANT: Marc J. Rubenfield et al. 

; TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO PSEUDOMONAS 
; TITLE OF INVENTION: AERUGINOSA FOR DIAGNOSTICS AND THERAPEUTICS 
; FILE REFERENCE: 107196.136 

; CURRENT APPLICATION NUMBER: US/09/2 52 , 9 91A 
; CURRENT FILING DATE: 1999-02-18 

PRIOR APPLICATION NUMBER: US 60/074,788 
; PRIOR FILING DATE: 1998-02-18 
; PRIOR APPLICATION NUMBER: US 60/094,190 
; PRIOR FILING DATE: 1998-07-27 
; NUMBER OF SEQ ID NOS : 33142 
; SEQ ID NO 27916 

LENGTH: 354 

TYPE: PRT 

; ORGANISM: Pseudomonas aeruginosa 
US-09-252-991A-27 916 

Query Match 9.5%; Score 83; DB 2; Length 354; 

Best Local Similarity 32.1%; Pred. No. 0.57; 

Matches 26; Conservative 11; Mismatches 28; Indels 16; Gaps 2; 

Qy 2 6 EALAEKPAKVQIQLEKVYLDGDVGIENKVEAARTLEDFKAAYKGWQLIDQKKGFILFRKQ 85 

: I I : I I I I : I I I I I :: I I I I I I I I : I : I : : I : 

Db 112 KAAAAGMVRAQIQLAMLYRDGDGGPQDKTEAARWFR--KAAEQGDAAAQNEMGVLYWRGE 169 

Qy 8 6 VDDISPLSKTNGYIGVTEDGV 10 6 

I I : I I 

Db 170 GVDQDRV 17 6 



RESULT 7 

US-09-583-110-3094 

; Sequence 3094, Application US/09583110 
; Patent No. 6699703 
; GENERAL INFORMATION: 

; APPLICANT: Lynn Doucette-Stamm et al. 

TITLE OF INVENTION: Nucleic Acid and Amino Acid Sequences Relating to Streptococcu 
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; TITLE OF INVENTION: Pneumoniae for Diagnostics and Therapeutics 

; FILE REFERENCE: PATH00-07A 

; CURRENT APPLICATION NUMBER: US/09/583,110 

; CURRENT FILING DATE: 2000-05-26 

; PRIOR APPLICATION NUMBER: US 09/107,433 

; PRIOR FILING DATE: 1998-06-30 

; PRIOR APPLICATION NUMBER: US 60/085,131 

/ PRIOR FILING DATE: 1998-05-12 

/ PRIOR APPLICATION NUMBER: US 60/051,553 

; PRIOR FILING DATE: 1997-07-02 

/ NUMBER OF SEQ ID NOS : 5322 

/ SEQ ID NO 3094 

LENGTH: 166 

TYPE: PRT 

ORGANISM: Streptococcus pneumoniae 
US-09-583-110-3094 



Query Match 9.1%; Score 80; DB 2; Length 166; 

Best Local Similarity 24.7%; Pred. No. 0.42; 

Matches 45; Conservative 27; Mismatches 44; Indels 66; Gaps 11; 

:gsllaalsfhlealaekpakvqiq LEKWLDGDVGIENKVEAARTLEDFKAAYK 68 

: : I I : : | | | : : | I : I : I : III I : I I I 
r QNVLVETGFQTKSLLEKISYLQNQARSKNIEIIYVQ HIEN — SEAQTSED 58 

WQL IDQKKGFILFR KQ VDDISPLS 93 

III : I I I : I: IE II :: 



II : : I : : I I I I : I I I : : I I I I I I I : : : 

GYQLIVPEGAVTTFDG-DDIPAETINEFYE-DI--WEERFADVL DYK 163 



Qy 


14 


Db 


10 


Qy 


69 


Db 


59 


Qy 


94 


Db 


118 


Qy 


154 


Db 


164 



RESULT 8 

US-09-107-433-3096 

; Sequence 3096, Application US/09107433 
; Patent No. 6800744 

GENERAL INFORMATION: 
; APPLICANT: Lynn A Doucette-Stamm and David Bush 

TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID 

SEQUENCES RELATING TO STREPTOCOCCUS PNEUMONIAE FOR DIAG 
THERAPEUTICS 
NUMBER OF SEQUENCES: 520 6 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: GENOME THERAPEUTICS CORPORATION 
STREET: 100 Beaver Street 
CITY: Waltham 
; STATE: Massachusetts 

COUNTRY: USA 
ZIP: 02354 
COMPUTER READABLE FORM: 

MEDIUM TYPE: CD/ROM ISO9660 
COMPUTER: 
OPERATING SYSTEM: 
SOFTWARE: 
CURRENT APPLICATION DATA: 
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APPLICATION NUMBER: US/09/107,433 

FILING DATE: 30-Jun-1998 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 60/ 085131 

FILING DATE: May 12, 1998 

APPLICATION NUMBER: 60/051553 

FILING DATE: July 2, 1997 
ATTORNEY /AGENT INFORMATION: 

NAME: Ariniello, Pamela Deneke 

REGISTRATION NUMBER: 40,489 

REFERENCE/DOCKET NUMBER: GTC-011 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (781)893-5007 

TELEFAX: (781)893-8277 
INFORMATION FOR SEQ ID NO: 3096: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 166 amino acids 

/ TYPE: amino acid 

TOPOLOGY: linear 
MOLECULE TYPE: protein 
HYPOTHETICAL: YES 
ORIGINAL SOURCE: 
/ ORGANISM: Streptococcus pneumoniae 

FEATURE : 

NAME/KEY: misc_feature 

LOCATION: (B) LOCATION 1...166 
SEQUENCE DESCRIPTION: SEQ ID NO: 3096: 
US-09-107-433-3096 

Query Match 9.1%; Score 80; DB 2; Length 166; 

Best Local Similarity 24.7%; Pred. No. 0.42; 

Matches 45; Conservative 27; Mismatches 44; Indels 66; Gaps 11; 

Qy 14 IGSLLAALSFHLEALAEKPAKVQIQ LEKVYLDGDVGIENKVEAARTLEDFKAAYK 68 

: : : I I : : I I I : : I I : I : I : III hill 
Db 10 VQNVLVETGFQTKSLLEKISYLQNQARSKNIEIIYVQ HIEN--SEAQTSED 58 

Qy 69 GWQL IDQKKGFILFR KQ VDDISPLS 93 

III : | | | : | : I I I I : : 

Db 59 -WQLSALLNRKPAEKVFQKKYNS I FKETGLKEYLDKQGI EKLVLCGMQTEYCVDTSVKVA 117 

Qy 94 KTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYRTKKEFE 153 

II : : I : : I I I I : I I I : : I I I I I I I :: : 

Db 118 FEYGYQLIVPEGAVTTFDG-DDIPAETINEFYE-DI- -WEERFADVL DYK 163 

Qy 154 HV 155 

I : 

Db 164 HI 165 



RESULT 9 

US-09-315-794-42 

; Sequence 42, Application US/09315794 

; Patent No. 6197517 

; GENERAL INFORMATION: 

; APPLICANT: Roberts, Christopher J. 

; TITLE OF INVENTION: ESSENTIAL GENES OF YEAST AS TARGETS FOR ANTIFUNGAL 

TITLE OF INVENTION: AGENTS, HERBICIDES, INSECTICIDES AND ANTI-PROLI FERATION 
; TITLE OF INVENTION: DRUGS 
; FILE REFERENCE: 9301-053 

; CURRENT APPLICATION NUMBER: US/09/315,7 94 
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CURRENT FILING DATE: 1999-05-21 
NUMBER OF SEQ ID NOS : 64 
SOFTWARE: Patentln Ver. 2.0 
SEQ ID NO 4 2 
LENGTH: 3 48 
TYPE: PRT 

ORGANISM: Saccharomyces cerevisiae 
US-09-315-794-42 

Query Match 9.0%; Score 79; DB 2; Length 348; 

Best Local Similarity 24.1%; Pred. No. 1.6; 

Matches 42; Conservative 33; Mismatches 67; Indels 32; Gaps 8; 

Qy 5 SKFKIGLLLIGSLLAALS FHLEALAEKPAKVQIQLEKVYL DGDVG I EN - KVEAAR 58 

1 :: I I I I I I : : I : i : : : : I I I : : I i I I 

Db 2 SQYKTGLLLIHPAVTTTPELVENTKAQAASKKVKFVDQFLINKLNDGSITLENAKYETVH 61 

Qy 59 TLEDFKAAYKGWQLIDQKKGFILFRKQVDDISPLS KTNG-YIGVTE DGVISTF 110 

I : : III:: III: III I I : : : I : I : I 
Db 62 YLTPEAQTDIKFPKKL--ISVLADSLKPNGSLIGLSDIYKVDALINGF 107 

Qy HI HGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYRTKKEFEHVIEAVKSSGS 164 

I : : I I : : : I : I : : • I : I I I : : I : I 

Db 108 E IINEPDYCWIKMDSSKLNQTVSIPLKKKKTNNTKLQSGSKLPTFKKASS 157 



RESULT 10 
US-09-389-341-42 

; Sequence 42, Application US/09389341 

; Patent No. 6200803 

; GENERAL INFORMATION: 

; APPLICANT: Roberts, Christopher J. 

; TITLE OF INVENTION: ESSENTIAL GENES OF YEAST AS TARGETS FOR ANTIFUNGAL 

; TITLE OF INVENTION: AGENTS, HERBICIDES, INSECTICIDES AND ANT I -PROLIFERATIVE 

; TITLE OF INVENTION: DRUGS 

; FILE REFERENCE: 9301-057 

; CURRENT APPLICATION NUMBER: US/09/389,341 

; CURRENT FILING DATE: 1999-09-02 

; EARLIER APPLICATION NUMBER: 09/315,794 

; EARLIER FILING DATE: 1999-05-21 

; NUMBER OF SEQ ID NOS: 72 

; SOFTWARE: Patentln Ver . 2.0 

; SEQ ID NO 42 

LENGTH: 3 48 

TYPE: PRT 

ORGANISM: Saccharomyces cerevisiae 
US-09-389-341-42 

Query Match 9.0%; Score 79; DB 2; Length 348; 

Best Local Similarity 24.1%; Pred. No. 1.6; 

Matches 42; Conservative 33; Mismatches 67; Indels 32; Gaps . 8; 

Qy 5 SKFKIGLLLIGSLLAALS FHLEALAEKPAKVQIQLEKVYL DGDVG I EN -KVEAAR 58 

|::| Mill : :| : I ::: :| II : :M I I 

Db 2 SQYKTGLLLIHPAVTTTPELVENTKAQAASKKVKFVDQFLINKLNDGSITLENAKYETVH 61 

Qy 59 TLEDFKAAYKGWQLIDQKKGFILFRKQVDDISPLS — KTNG-YIGVTE DGVISTF 110 

I : : Ml:: III: III II::: I :l: I 
Db 62 YLTPEAQTDIKFPKKL— ISVLADSLKPNGSLIGLSDIYKVDALINGF 107 

Qy 111 HGRPGILSEPIQSFFQIDIKRLESRMADDLRKGIPYRTKKEFEHVIEAVKSSGS 164 
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I : : I I : : : I : I : : I : I I I : : I : I 

Db 108 E IINEPDYCWIKMDSSKLNQTVSIPLKKKKTNNTKLQSGSKLPTFKKASS 157 

RESULT 11 

US-09-605-703B-1144 

Sequence 1144, Application US/09605703B 
Patent No. 6962989 
GENERAL INFORMATION : 
APPLICANT: Pompejus, Markus 
APPLICANT: Kroger, Burkhard 
APPLICANT: Schroder, Hartwig 
APPLICANT: Zelder, Oskar 
APPLICANT: Haberhauer, Gregor 

TITLE OF INVENTION: CORYNE BACTERIUM GLUTAMICUM GENES ENCODING NOVEL 
TITLE OF INVENTION: PROTEINS 
FILE REFERENCE: BGI-129CP 

CURRENT APPLICATION NUMBER: US/09/605, 703B 
CURRENT FILING DATE: 2000-06-27 
PRIOR APPLICATION NUMBER: 60/142,7 64 
PRIOR FILING DATE: 1999-07-08 
PRIOR APPLICATION NUMBER: 60/152,318 
PRIOR FILING DATE: 1999-09-03 
NUMBER OF SEQ ID NOS : 2 934 
SEQ ID NO 1144 
LENGTH: 510 
TYPE: PRT 

ORGANISM: Corynebacterium glutamicum 
US-09-605-703B-1144 

Query Match 8.9%; Score 77.5; DB 2; Length 510; 

Best Local Similarity 21.4%; Pred. No. 4.3; 

Matches 43; Conservative 29; Mismatches 76; Indels 53; Gaps 9; 

Qy 12 LLIGSLLAALS FHLEALAEKPAKVQ IQLEKVYLDGDVGIENKVEAARTLEDFKAAY 67 

II: I hi I 1:1 :|: : : : I : I : : I I I I : 
Db 9 LLVAPLT AS LV FCNLAVAANAVE VEAE S P WI NE VE SNS D - PVGDWVE LANT DNNNS I D I 67 

Qy 68 KGWQLIDQKK GFILFRKQVDDISPLSKTNG YIGVTEDGVI 107 

I I I : I I : I : : I I : I I I I : : I : 

Db 68 SGWSLVDDKEDLENALVLPEGTEIESGGYFVIYTDSADYVPTNNTFGGQEYFGLGKDDTV 127 

Qy 108 STFHGRPGILSEPIQSFFQIDI KRLESRMADDLRKGIPYRTKKEFEHVIEAV 159 

I | | : :: | : | : | I : I I I 
Db 128 -TLRNAEG EWATYSWKDLGEHAENTYGRIPDMTGDFANTGVPTPGAKNV AA 178 

Qy 160 KSSGSQ HHVE 169 

: I I : I : I I 

Db 17 9 EGS GEEEGWANAQL P FHNVE 199 

RESULT 12 

US-09-605-703B-1142 

Sequence 1142, Application US/09605703B 
Patent No. 6962989 
GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Pompejus, Markus 
Kroger, Burkhard 
Schroder, Hartwig 
Zelder, Oskar 
Haberhauer, Gregor 
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/ TITLE OF INVENTION: CORYNEBACTERIUM GLUTAMICUM GENES ENCODING NOVEL 
; TITLE OF INVENTION: PROTEINS 
; FILE REFERENCE: BGI-129CP 

/ CURRENT APPLICATION NUMBER: US/09/ 605 , 703B 

; CURRENT FILING DATE: 2000-06-27 

; PRIOR APPLICATION NUMBER: 60/142,764 

; PRIOR FILING DATE: 1999-07-08 

; PRIOR APPLICATION NUMBER: 60/152,318 

; PRIOR FILING DATE: 1999-09-03 

/ NUMBER OF SEQ ID NOS : 2934 

/ SEQ ID NO 1142 

LENGTH: 534 

TYPE: PRT 

ORGANISM: Corynebacterium glutamicum 
US-09-605-703B-1142 

Query Match 8.9%; Score 77.5; DB 2; Length 534; 

Best Local Similarity 21.4%; Pred. No. 4.6; 

Matches 43; Conservative 29; Mismatches 76; Indels 53; Gaps 9; 

Qy 12 LLIGSLLAALS FHLEALAEKPAKVQ IQLEKVYLDGDVGIENKVEAARTLEDFKAAY 67 

II: I I : I I I : I : I : : : : I : I : : I I I I : 
Db 9 L LVAP LT AS LV FC NLAVAANAVE VEAE S P WI NE VE S N S D - PVGDWVE LAN T DNNN S I D I 67 

Qy 68 KGWQLIDQKK GFILFRKQVDDIS PLSKTNG YIGVTEDGVI 107 

I I I : I I : I : : I I : I I I I : : I : 

Db 68 SGWSLVDDKEDLENALVLPEGTEIESGGYFVIYTDSADYVPTNNTFGGQEYFGLGKDDTV 127 

Qy 108 STFHGRPGILSEPIQSFFQIDI KRLESRMADDLRKGIPYRTKKE FEHVIEAV 15 9 

I | | : : : | : | : | I : I I I 
Db 128 -TLRNAEG EWATYSWKDLGEHAENTYGRIPDMTGDFANTGVPTPGAKNV AA 178 

Qy 160 KSSGSQ HHVE 169 

: I I : I : I I 

Db 179 EGSGEEEGWANAQLPFHNVE 19 9 



RESULT 13 

US-09-732-210-1631 

Sequence 1631, Application US/09732210 
Patent No. 6573361 
GENERAL INFORMATION: 
APPLICANT: Bunkers, Greg J. 
APPLICANT: Liang, Jihong 
APPLICANT: Mittanck, Cindy A. 
APPLICANT: Seale, Jeffrey W. 
APPLICANT: Wu, Yonnie S. 

TITLE OF INVENTION: Anti-fungal Proteins and Methods for Their Use 
FILE REFERENCE: 38-21 ( 15036 ) B 
CURRENT APPLICATION NUMBER: US/09/732,210 
CURRENT FILING DATE: 2000-12-07 
PRIOR APPLICATION NUMBER: US 60/169,513 
PRIOR FILING DATE: 1999-12-07 
PRIOR APPLICATION NUMBER: US 60/169,340 
PRIOR FILING DATE: 1999-12-07 
NUMBER OF SEQ ID NOS: 1753 
SEQ ID NO 1631 
LENGTH: 155 
TYPE: PRT 

ORGANISM: Helicobacter pylori 
US-09-732-210-1631 
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Query Match 8.7%; Score 76.5; DB 2; Length 155; 

Best Local Similarity 24.5%; Pred. No. 0.96; 

Matches 34; Conservative 21; Mismatches 53; Indels 31; Gaps 6; 

Qy 39 LEKVYLDGDVGIENKVEAARTLEDFKAAYKGWQLIDQK KGFILFRKQVDDISPL 92 

: I : I I : I : I I : I : : I I I : I I : : : I I 

Db 2 6 INKMMFDGKKSVAEKI IYKAFNKIEEKSGEKGIEVFEKALERVRPLVEV 74 

Qy 93 -SKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQIDIKRLESRMADDLR K 142 

I : I 1 I : I III: : I I I I I I I 

Db 75 RSRRVG— GATYQVPVEVRASRQQSLS--IRWILEATRKRNERMMVDRLANELMDAASDK 130 

Qy 143 GIPYRTKKEFE H V I E AVKS 161 

I : : I : : : I I I : 
Db 131 GAAFKKKEDVHKMAEANKA 14 9 



RESULT 14 

US-09-134-000C-3598 

Sequence 3598, Application US/09134000C 
Patent No. 6617156 
GENERAL INFORMATION: 
APPLICANT: Lynn Doucette-Stamm et al 

TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO 
TITLE OF INVENTION: ENTEROCOCCUS FAECAL IS FOR DIAGNOSTICS AND THERAPEUTICS 
FILE REFERENCE: 032796-032 

CURRENT APPLICATION NUMBER: US/09/134, 000C 
CURRENT FILING DATE: 1998-08-13 
PRIOR APPLICATION NUMBER: US 60/055,778 
PRIOR FILING DATE: 1997-08-15 
NUMBER OF SEQ ID NOS : 6812 
SOFTWARE: Patentln version 3.1 
SEQ ID NO 3598 
LENGTH: 4 63 
TYPE: PRT 

ORGANISM: Enterococcus faecalis 
FEATURE : 

NAME/KEY: MISC_FEATURE 
LOCATION: (262) . . (262) 

OTHER INFORMATION: Amino acid 262 is Xaa wherein Xaa = any amino acid. 
US-09-134-000C-3598 

Query Match 8.7%; Score 76.5; DB 2; Length 463; 

Best Local Similarity 27.2%; Pred. No. 4.8; 

Matches 41; Conservative 24; Mismatches 67; Indels 19; Gaps 7; 

Qy 3 SRSKFKIGLLLIGSLLAALSFHLEALAEKPAKVQIQLEKVYLDGDVGIENKVEA ART 5 9 

I I : I : : : I : I I : I : : : : I I : I I I : I : : I II 

Db 26 SNRKYIIAVTACPTGIAHTYMAEDALKKKAKEMGVDI-KVETNGSEGIKNRLTAEDIARA 84 

Qy 60 LED FKAA Y KGWQ L — I DQKKGFI L FRKQ VDDI S PLS KTNG YIGVTEDGVI S T FHGRPGI L 117 

III:: III I : I I : I I I : I I I I I 
Db 85 DGVIVAADKKVEMNRFDGKK LVNRPVSD — GIRKTEELINLAISGEAPTFHGSDSAA 139 

Qy 118 SEPIQSFFQIDIKR--LESRMADDLRKGIPY 146 

I : II : | | : | | I : : 

Db 140 SD QEDSAEGSIGSRIYKDLMNGVSH 164 



RESULT 15 
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PCT-US95-17026-2 

Sequence 2, Application PC/TUS951702 6 
GENERAL INFORMATION : 

APPLICANT: ZymoGenetics , Inc. 
APPLICANT: 1201 Eastlake Avenue East 
APPLICANT: Seattle 
APPLICANT : WA 
APPLICANT : USA 
APPLICANT: 98102 

TITLE OF INVENTION: Bovine Factor XIII 
NUMBER OF SEQUENCES : 13 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: ZymoGenetics , Inc. 
STREET: 1201 Eastlake Avenue East 
CITY: Seattle 
STATE : WA 
COUNTRY : USA 
ZIP: 98102 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.25 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: PCT/US95/1702 6 
FILING DATE: 
CLASSIFICATION: 
ATTORNEY /AGENT INFORMATION: 
NAME: Parker, Gary E 
REGISTRATION NUMBER: 31-648 
REFERENCE/DOCKET NUMBER: 94-18 PC 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 20 6-442-6673 
TELEFAX: 206-442-6678 
INFORMATION FOR SEQ ID NO: 2: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 7 32 amino acids 
TYPE: amino acid 
TOPOLOGY: linear 
MOLECULE TYPE: protein 
PCT-US95-17026-2 

Query Match 8.7%; Score 76.5/ DB 5; Length 732; 

Best Local Similarity 20.6%; Pred. No. 9.5; 

Matches 32; Conservative 25; Mismatches 47; Indels 51; Gaps 6; 

Qy 32 PAKV QIQLEKVYLDGDVGIENKV EAARTLEDFKAAY 67 

11:1 : | | : : : | : | : : | : II I I : 

Db 332 PARWTNYFSAHDNDANLQLD- 1 FLEEDGNVNSKLTKDS VWNYHCWNEAWMTRPDLPVGF 390 

Qy 68 KGWQLIDQKKGFILFRKQVDDISPLSKTNGYIGVTEDGVISTFHGRPGILSEPIQSFFQI 127 

I I I : : I : I : : I I : I I II 
Db 391 GGWQWDS TPQENSDGMYRCGPASVQAIKHG HVCFQF 427 

Qy 128 DIKRLESRMADDLRKGIPYRTKKE FEHVIEAVKSS 162 

I : : : I I : | | : | I : I I : : : 
Db 428 DAP FVFAE VN SDL VYVTAKKDG TH WEALD TT 459 



Search completed: June 2, 2006, 20:28:39 
Job time : 50 sees 
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SCORE 1.3 BuildDate: 12/06/2005 
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GenCore version 5.1.9 

Copyright (c) 1993 - 2006 Biocceleration Ltd. 



OM protein - protein search, using sw model 

2, 2006, 20:25:48 



Run on: 



Title: 

Perfect score: 
Sequence : 

Scoring table: 



June 



US-10-510-408-2 
875 

1 MYSRSKFKIGLLLIGSLLAA. 



Search time 85 Seconds 
(without alignments) 
930.570 Million cell updates/sec 



. HVIEAVKSSGSQHHVEDMKT 173 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 2589679 seqs, 457216429 residues 

Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



2589679 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Database 



A_Geneseq__8 : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 



geneseqpl980s 
geneseqpl990s 
geneseqp2000s 
geneseqp2001s 
geneseqp2002s 
geneseqp2003a 
geneseqp2003b 
geneseqp2004s 
geneseqp2005s 
geneseqp2006 



: * 
: * 
: * 
: * 
: * 
s:' 

★ 
* 

s:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 

Match Length 


DB 


ID 


Description 


1 


875 


100.0 


173 


8. 


ADG32154 


Adg32154 


Mutant B_ 


2 


875 


100.0 


173 


8 


ADN60330 


Adn60330 


B. lichen 


3 


98.5 


11.3 


712 


4 


AAE12019 


Aael2019 


Neisseria 


4 


98.5 


11.3 


712 


4 
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Aau27566 
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5 


94 


10.7 
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6 
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6 


92 


10.5 
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3 
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7 


92 


10.5 
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3 
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Aag59649 


Arabidops 


8 


92 


10.5 


638 


3 
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Aag59648 
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9 


91 


10.4 
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2 
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Aayl9947 


B. burgdo 


10 


91 


10.4 
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2 


AAY19946 


Aayl9946 


B. burgdo 


11 


89.5 


10.2 


435 


7 


ADJ70167 


Adj 70167 


Human hea 


12 


87.5 


10.0 


431 


7 


ADJ70823 


Adj70823 


Human hea 
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